An ultra-lightweight detector with high accuracy and speed for aerial images

Lei Yang; Guowu Yuan; Hao Wu; Wenhua Qian; Lei Yang; Guowu Yuan; Hao Wu; Wenhua Qian

doi:10.3934/mbe.2023621

Mathematical Biosciences and Engineering

2023, Volume 20, Issue 8: 13947-13973. doi: 10.3934/mbe.2023621

Previous Article Next Article

Research article Special Issues

An ultra-lightweight detector with high accuracy and speed for aerial images

1.
School of Information Science and Engineering, Yunnan University, Kunming 650504, Yunnan, China
2.
Yunnan Key Laboratory of Intelligent Systems and Computing, Kunming 650504, Yunnan, China

Academic Editor: Jorge Bernardino

Received: 07 March 2023 Revised: 17 May 2023 Accepted: 05 June 2023 Published: 20 June 2023

Aerial remote sensing images have complex backgrounds and numerous small targets compared to natural images, so detecting targets in aerial images is more difficult. Resource exploration and urban construction planning need to detect targets quickly and accurately in aerial images. High accuracy is undoubtedly the advantage for detection models in target detection. However, high accuracy often means more complex models with larger computational and parametric quantities. Lightweight models are fast to detect, but detection accuracy is much lower than conventional models. It is challenging to balance the accuracy and speed of the model in remote sensing image detection. In this paper, we proposed a new YOLO model. We incorporated the structures of YOLOX-Nano and slim-neck, then used the SPPF module and SIoU function. In addition, we designed a new upsampling paradigm that combined linear interpolation and attention mechanism, which can effectively improve the model's accuracy. Compared with the original YOLOX-Nano, our model had better accuracy and speed balance while maintaining the model's lightweight. The experimental results showed that our model achieved high accuracy and speed on NWPU VHR-10, RSOD, TGRS-HRRSD and DOTA datasets.

Keywords:

Citation: Lei Yang, Guowu Yuan, Hao Wu, Wenhua Qian. An ultra-lightweight detector with high accuracy and speed for aerial images[J]. Mathematical Biosciences and Engineering, 2023, 20(8): 13947-13973. doi: 10.3934/mbe.2023621

Related Papers:

[1]	Sung Woo Choi . Explicit characteristic equations for integral operators arising from well-posed boundary value problems of finite beam deflection on elastic foundation. AIMS Mathematics, 2021, 6(10): 10652-10678. doi: 10.3934/math.2021619
[2]	Moh. Alakhrass . A note on positive partial transpose blocks. AIMS Mathematics, 2023, 8(10): 23747-23755. doi: 10.3934/math.20231208
[3]	Xinfeng Liang, Mengya Zhang . Triangular algebras with nonlinear higher Lie n-derivation by local actions. AIMS Mathematics, 2024, 9(2): 2549-2583. doi: 10.3934/math.2024126
[4]	Cui-Xia Li, Long-Quan Yong . Modified BAS iteration method for absolute value equation. AIMS Mathematics, 2022, 7(1): 606-616. doi: 10.3934/math.2022038
[5]	Sara Smail, Chafika Belabbaci . A characterization of Wolf and Schechter essential pseudospectra. AIMS Mathematics, 2024, 9(7): 17146-17153. doi: 10.3934/math.2024832
[6]	Yuna Zhao . Construction of blocked designs with multi block variables. AIMS Mathematics, 2021, 6(6): 6293-6308. doi: 10.3934/math.2021369
[7]	Wen-Ning Sun, Mei Qin . On maximum residual block Kaczmarz method for solving large consistent linear systems. AIMS Mathematics, 2024, 9(12): 33843-33860. doi: 10.3934/math.20241614
[8]	Shakir Ali, Amal S. Alali, Atif Ahmad Khan, Indah Emilia Wijayanti, Kok Bin Wong . XOR count and block circulant MDS matrices over finite commutative rings. AIMS Mathematics, 2024, 9(11): 30529-30547. doi: 10.3934/math.20241474
[9]	James Daniel, Kayode Ayinde, Adewale F. Lukman, Olayan Albalawi, Jeza Allohibi, Abdulmajeed Atiah Alharbi . Optimised block bootstrap: an efficient variant of circular block bootstrap method with application to South African economic time series data. AIMS Mathematics, 2024, 9(11): 30781-30815. doi: 10.3934/math.20241487
[10]	Ziqiang Wang, Qin Liu, Junying Cao . A higher-order numerical scheme for system of two-dimensional nonlinear fractional Volterra integral equations with uniform accuracy. AIMS Mathematics, 2023, 8(6): 13096-13122. doi: 10.3934/math.2023661

Abstract

1. Introduction

A problem that occurs frequently in a variety of mathematical contexts, is to find the common invariant subspaces of a single matrix or set of matrices. In the case of a single endomorphism or matrix, it is relatively easy to find all the invariant subspaces by using the Jordan normal form. Also, some theoretical results are given only for the invariant subspaces of two matrices. However, when there are more than two matrices, the problem becomes much harder, and unexpected invariant subspaces may occur. No systematic method is known. In a recent article ^[1], we have provided a new algorithms to determine common invariant subspaces of a single matrix or of a set of matrices systematically.

In the present article we consider a more general version of this problem, that is, providing two algorithms for simultaneous block triangularization and block diagonalization of sets of matrices. One of the main steps in the first two proposed algorithms, consists of finding the common invariant subspaces of matrices using the new method proposed in the recent article ^[1]. It is worth mentioning that an efficient algorithm to explicitly compute a transfer matrix which realizes the simultaneous block diagonalization of unitary matrices whose decomposition in irreducible blocks (common invariant subspaces) is known from elsewhere is given in ^[2]. An application of simultaneous block-diagonalization of normal matrices in quantum theory is presented in ^[3].

In this article we shall be concerned with finite dimensions only. Of course the fact that a single complex matrix can always be put into triangular form follows readily from the Jordan normal form theorem ^[4]. For a set of matrices, Jacobson in ^[5] introduced the notion of a composition series for a collection of matrices. The idea of a composition series for a group is quite familiar. The Jordan-Hölder Theorem ^[4] states that any two composition series of the same group have the same length and the same composition factors (up to permutation). Jacobson in ^[5] characterized the simultaneous block triangularization of a set of matrices by the existence of a chain $\{0\} = V_0 \subset V_1 \subset... \subset V_t = \mathbb{C}^n$ of invariant subspaces with dimension $dim(V_i/V_{i-1}) = n_i$ . Therefore, in the context of a collection of matrices $\Omega = \{A_i \}_{ i = 1}^N$ , the idea is to locate a common invariant subspace $V$ of minimal dimension $d$ of a set of matrices $\Omega$ . Assume $V$ is generated by the (linearly independent) set $\mathcal{B}_1 = \{u_1, u_2, ..., u_d \}$ , and let $\mathcal{B} = \{u_1, u_2, ..., u_d, u_{d+1}, u_{d+2}, ..., u_n\}$ be a basis of $\mathbb{C}^n$ containing $\mathcal{B}_1$ . Upon setting $S = (u_1, u_2, ..., u_d, u_{d+1}, u_{d+2}, ..., u_n)$ , $S^{-1}A_i S$ has the block triangular form

$\begin{equation*} S^{-1}A_i S = \left( {\begin{array}{cc} B_{1,1}^i & B_{1,2}^i \\ 0 & B_{2,2}^i \\ \end{array} } \right), \end{equation*}$

for $i = 1, ..., n$ . Thereafter, one may define a quotient of the ambient vector space, and each of the matrices in the given collection will pass to this quotient. As such, one defines

$\begin{equation*} T_i = B_{2,2}^i = \left( {\begin{array}{*{20}{c}} \textbf{0}_{(n-d)\times d} & \textbf{I}_{n-d} \end{array}} \right)S^{-1}A_i S \left( {\begin{array}{*{20}{c}} \textbf{0}_{d \times (n-d)} \\ \textbf{I}_{n-d} \end{array}} \right). \end{equation*}$

Then one may begin again the process of looking for a common invariant subspace of minimal dimension of a set of matrices $\{T_i \}_{ i = 1}^N$ and iterate the procedure. Since all spaces and matrices are of finite dimension, the procedure must terminate at some point. Again, any two such composition series will be isomorphic. When the various quotients and submatrices are lifted back to the original vector space, one obtains precisely the block-triangular form for the original set of matrices. It is important to find a composition series in the construction in order to make the set of matrices as "block-triangular as possible."

Dubi ^[6] gave an algorithmic approach to simultaneous triangularization of a set of matrices based on the idea of Jacobson in ^[5]. In the case of simultaneous triangularization, it can be understood as the existence of a chain $\{0\} = V_0 \subset V_1 \subset... \subset V_t = \mathbb{C}^n$ of invariant subspaces with dimension $dim(V_i) = i$ . We generalize his study to cover simultaneous block triangularization of a set of matrices. The generalized algorithm depends on the novel algorithm for constructing invariant subspaces of a set of matrices given in the recent article ^[1].

Specht ^[7] (see also ^[8]) proved that if the associative algebra $\mathcal{L}$ generated by a set of matrices $\Omega$ over $\mathbb{C}$ satisfies $\mathcal{L} = \mathcal{L}^{*}$ , then $\Omega$ admits simultaneous block triangularization if and only if it admits simultaneous block diagonalization, in both cases via a unitary matrix. Following a result of Specht, we prove that a set of matrices $\Omega$ admits simultaneous block diagonalization if and only if the set $\Gamma = \Omega \cup \Omega^{*}$ admits simultaneous block triangularization. Finally, an algorithmic approach to simultaneous block diagonalization of a set of matrices based on this fact is proposed.

The latter part of this paper presents an alternate approach for simultaneous block diagonalization of a set of $n \times n$ matrices $\{A_s\}_{ s = 1}^N$ by an invertible matrix that does not require finding the common invariant subspaces. Maehara et al. ^[9] introduced an algorithm for simultaneous block diagonalization of a set of matrices by a unitary matrix based on the existence of a Hermitian commuting matrix. Here, we extend their algorithm to simultaneous block diagonalization of a set of matrices by an invertible matrix based on the existence of a commuting matrix which is not necessarily Hermitian. For example, consider the set of matrices $\Omega = \{A_i \}_{i = 1}^2$ where

$\begin{equation} A_1 = \left( \begin{array}{ccc} 1&0&0\\ 2&2&0 \\ 1&1&1\end{array} \right) , A_2 = \left( \begin{array}{ccc} 0&0&0\\ 2&1&0 \\ 0&1&0\end{array} \right) . \end{equation}$

(1.1)

The only Hermitian matrix commuting with the set $\Omega$ is the identity matrix. Therefore, we cannot apply the proposed algorithm given in ^[9]. However, one can verify that the following non Hermitian matrix $C$ commutes with all the matrices $\{A_i \}_{ i = 1}^2$

$\begin{equation} C = \left( \begin{array}{ccc} 0&0&0\\ 2&1&0 \\ 0&1&0\end{array} \right). \end{equation}$

(1.2)

The matrix $C$ has distinct eigenvalues $\lambda_1 = 0, \lambda_2 = 1$ with algebraic multiplicities $n_1 = 2, n_2 = 1$ , respectively. Moreover, the matrix $C$ is not diagonalizable. Therefore, we cannot construct the eigenvalue decomposition for the matrix $C$ . However, one can decompose the matrix $C$ by its generalized eigen vectors as follows:

$\begin{equation} S^{-1}C S = \left(\begin{array}{ccc} 0&1&0\\ 0&0&0 \\ 0&0&1\end{array} \right) = \left(\begin{array}{cc} 0&1\\ 0&0\\ \end{array}\right) \oplus \left(1\right), \end{equation}$

(1.3)

where

$\begin{equation} S = \left(\begin{array}{ccc} 0&-\frac{1}{2}&0\\ 0&1&1 \\ 1&0&1\end{array} \right). \end{equation}$

(1.4)

Initially, it is noted that the matrices $\{A_i \}_{ i = 1}^2$ can be decomposed into two diagonal blocks by the constructed invertible matrix $S$ where

$\begin{equation} \begin{array}{cc} S^{-1}A_1 S = \left(\begin{array}{cc} 1&\frac{1}{2}\\ 0&1\\ \end{array}\right) \oplus \left(2\right),& S^{-1}A_2 S = \left(\begin{array}{cc} 0&1\\ 0&0\\ \end{array}\right) \oplus \left(1\right). \end{array} \end{equation}$

(1.5)

Then, a new algorithm is developed for simultaneous block diagonalization by an invertible matrix based on the generalized eigenvectors of a commuting matrix. Moreover, a new characterization is presented by proving that the existence of a commuting matrix that possesses at least two distinct eigenvalues is the necessary and sufficient condition to guarantee the simultaneous block diagonalization by an invertible matrix.

An outline of the paper is as follows. In Section 2 we review several definitions pertaining to block-triangular and block-diagonal matrices and state several elementary consequences that follow from them. In Section 3, following a result of Specht ^[7] (see also ^[8]), we provide conditions for putting a set of matrices into block-diagonal form simultaneously. Furthermore, we apply the theoretical results to provide two algorithms that enable a collection of matrices to be put into block-triangular form or block-diagonal form simultaneously by a unitary matrix based on the existence of invariant subspaces. In Section 4, a new characterization is presented by proving that the existence of a commuting matrix that possesses at least two distinct eigenvalues is the necessary and sufficient condition to guarantee the simultaneous block diagonalization by an invertible matrix. Furthermore, we apply the theoretical results to provide an algorithm that enables a collection of matrices to be put into block-diagonal form simultaneously by an invertible matrix based on the existence of a commuting matrix. Sections 3 and 4 also provide concrete examples using the symbolic manipulation system Maple.

2. Preliminaries

Let $\Omega$ be a set of $n \times n$ matrices over an algebraically closed field $\mathcal{F}$ , and let $\mathcal{L}$ denote the algebra generated by $\Omega$ over $\mathcal{F}$ . Similarly, let $\Omega^{*}$ be the set of the conjugate transpose of each matrix in $\Omega$ and $\mathcal{L}^{*}$ denote the algebra generated by $\Omega^{*}$ over $\mathcal{F}$ .

Definition 2.1. An $n \times n$ matrix $A$ is given the notation $BT(n_1, ..., n_t)$ provided $A$ is block upper triangular with $t$ square blocks on the diagonal, of sizes $n_1, ..., n_t$ , where $t \geq 2$ and $n_1+... +n_t = n$ . That is, a block upper triangular matrix $A$ has the form

$\begin{equation} {\bf{A}} = \left( {\begin{array}{*{20}c} {{\bf{A}}_{1,1} } & {{\bf{A}}_{1,2} } & \cdots & {{\bf{A}}_{1,t} } \\ 0 & {{\bf{A}}_{2,2} } & \cdots & {{\bf{A}}_{2,t} } \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \cdots & {{\bf{A}}_{t,t} } \\ \end{array} } \right) \end{equation}$

(2.1)

where ${\bf{A}}_{i, j}$ is a square matrix for all $i = 1, ..., t$ and $j = i, ..., t$ .

Definition 2.2. A set of $n \times n$ matrices $\Omega$ is $BT(n_1, ..., n_t)$ if all of the matrices in $\Omega$ are $BT(n_1, ..., n_t)$ .

Remark 2.3. A set of $n \times n$ matrices $\Omega$ admits a simultaneous triangularization if it is $BT(n_1, ..., n_t)$ with $n_i = 1$ for $i = 1, ..., t$ .

Remark 2.4. A set of $n \times n$ matrices $\Omega$ is $BT(n_1, ..., n_t)$ if and only if the algebra $\mathcal{L}$ generated by $\Omega$ is $BT(n_1, ..., n_t)$ .

Proposition 2.5. ^[7] (see also ^[8]) Let $\Omega$ be a nonempty set of complex $n \times n$ matrices. Then, there is a nonsingular matrix $S$ such that $S \Omega S^{-1}$ is $BT(n_1, ..., n_t)$ if and only if there is a unitary matrix $U$ such that $U \Omega U^{*}$ is $BT(n_1, ..., n_t)$ .

Theorem 2.6. [,Chapter Ⅳ] Let $\Omega$ be a nonempty set of complex $n \times n$ matrices. Then, there is a unitary matrix $U$ such that $U \Omega U^{*}$ is $BT(n_1, ..., n_t)$ if and only if the set $\Omega$ has a chain $\{0\} = V_0 \subset V_1 \subset... \subset V_t = \mathbb{C}^n$ of invariant subspaces with dimension $dim(V_i/V_{i-1}) = n_i$ .

Definition 2.7. An $n \times n$ matrix $A$ is given the notation $BD(n_1, ..., n_t)$ provided $A$ is block diagonal with $t$ square blocks on the diagonal, of sizes $n_1, ..., n_t$ , where $t \geq 2$ , $n_1+... +n_t = n$ , and the blocks off the diagonal are the zero matrices. That is, a block diagonal matrix $A$ has the form

$\begin{equation} {\bf{A}} = \left( {\begin{array}{*{20}c} {{\bf{A}}_1 } & 0 & \cdots & 0 \\ 0 & {{\bf{A}}_2 } & \cdots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \cdots & {{\bf{A}}_t } \\ \end{array} } \right) \end{equation}$

(2.2)

where ${\bf{A}}_k$ is a square matrix for all $k = 1, ..., t$ . In other words, matrix ${\bf{A}}$ is the direct sum of ${\bf{A}}_1, ..., {\bf{A}}_t$ . It can also be indicated as ${\bf{A}}_{\text{1}} \oplus {\bf{A}}_{\text{2}} \oplus... \oplus {\bf{A}}_{\text{t}}$ .

Definition 2.8. A set of $n \times n$ matrices $\Omega$ is $BD(n_1, ..., n_t)$ if all of the matrices in $\Omega$ are $BD(n_1, ..., n_t)$ .

Remark 2.9. A set of $n \times n$ matrices $\Omega$ admits a simultaneous diagonalization if it is $BD(n_1, ..., n_t)$ with $n_i = 1$ for $i = 1, ..., t$ .

Remark 2.10. A set of $n \times n$ matrices $\Omega$ is $BD(n_1, ..., n_t)$ if and only if the algebra $\mathcal{L}$ generated by $\Omega$ is $BD(n_1, ..., n_t)$ .

Proposition 2.11. ^[7] (see also ^[8]) Let $\Omega$ be a nonempty set of complex $n \times n$ matrices and let $\mathcal{L}$ be the algebra generated by $\Omega$ over $\mathbb{C}$ . Suppose $\mathcal{L} = \mathcal{L}^{*}$ . Then, there is a nonsingular matrix $S$ such that $S \mathcal{L} S^{-1}$ is $BT(n_1, ..., n_t)$ if and only if there is a unitary matrix $U$ such that $U \mathcal{L} U^{*}$ is $BD(n_1, ..., n_t)$ .

3. Algorithms for simultaneous block triangularization and block diagonalization of a set of matrices based on the invariant subspaces

Dubi ^[6] gave an algorithmic approach to simultaneous triangularization of a set of $n \times n$ matrices. In this section, we will generalize his study to cover simultaneous block triangularization and simultaneous block diagonalization of a set of $n \times n$ matrices. The generalized algorithms depend on the novel algorithm for constructing invariant subspaces of a set of matrices given in the recent article ^[1] and Theorem 3.3.

Lemma 3.1. Let $\Omega$ be a nonempty set of complex $n \times n$ matrices, $\Omega^{*}$ be the set of the conjugate transpose of each matrix in $\Omega$ and $\mathcal{L}$ be the algebra generated by $\Gamma = \Omega \cup \Omega^{*}$ . Then, $\mathcal{L} = \mathcal{L}^{*}$ .

Proof. Let $A$ be a matrix in $\mathcal{L}$ . Then, $A = P(B_1, ..., B_m)$ for some multivariate noncommutative polynomial $P(x_1, ..., x_m)$ and matrices $\{B_i\}_{i = 1}^m\in \Gamma$ . Therefore, $A^{*} = P^*(B_1, ..., B_m) = Q(B_1^*, ..., B_m^*)$ for some multivariate noncommutative polynomial $Q(x_1, ..., x_m)$ where the matrices $\{B_i^*\}_{i = 1}^m\in \Gamma^* = \Gamma$ . Hence, the matrix $A^* \in \mathcal{L}$

Lemma 3.2. Let $\Omega$ be a nonempty set of complex $n \times n$ matrices and $\Omega^{*}$ be the set of the conjugate transpose of each matrix in $\Omega$ , and $\Gamma = \Omega \cup \Omega^{*}$ . Then, there is a unitary matrix $U$ such that $U \Gamma U^{*}$ is $BD(n_1, ..., n_t)$ if and only if there is a unitary matrix $U$ such that $U \Omega U^{*}$ is $BD(n_1, ..., n_t)$ .

Proof. Assume that there exists a unitary matrix $U$ such that $U \Omega U^{*}$ is $BD(n_1, ..., n_t)$ . Then, $(U \Omega U^{*})^{*} = U \Omega^{*} U^{*}$ is $BD(n_1, ..., n_t)$ . Hence, $U \Gamma U^{*}$ is $BD(n_1, ..., n_t)$ .

Theorem 3.3. Let $\Omega$ be a nonempty set of complex $n \times n$ matrices and $\Omega^{*}$ be the set of the conjugate transpose of each matrix in $\Omega$ , and $\Gamma = \Omega \cup \Omega^{*}$ . Then, there is a unitary matrix $U$ such that $U \Omega U^{*}$ is $BD(n_1, ..., n_t)$ if and only if there is a unitary matrix $U$ such that $U \Gamma U^{*}$ is $BT(n_1, ..., n_t)$ .

Proof. Let $\mathcal{L}$ be the algebra generated by $\Gamma$ . Then, $\mathcal{L} = \mathcal{L}^{*}$ using Lemma 3.1. Now, by applying Proposition 2.11 and Lemma 3.2, the following statements are equivalent :

There is a unitary matrix $U$ such that $U \Gamma U^{*}$ is $BT(n_1, ..., n_t)$ .

$\iff$ There is a unitary matrix $U$ such that $U \mathcal{L} U^{*}$ is $BT(n_1, ..., n_t)$ .

$\iff$ There is a unitary matrix $U$ such that $U \mathcal{L} U^{*}$ is $BD(n_1, ..., n_t)$ .

$\iff$ There is a unitary matrix $U$ such that $U \Gamma U^{*}$ is $BD(n_1, ..., n_t)$ .

$\iff$ There is a unitary matrix $U$ such that $U \Omega U^{*}$ is $BD(n_1, ..., n_t)$ .

3.1. Algorithm $A$ : Simultaneous block triangularization of a set of $n \times n$ matrices $\{A_i \}_{ i=1}^N$ .

(1) Input: the set $\Omega = \{A_i \}_{ i = 1}^N$ .

(2) Set $k = 0, \mathcal{B} = \phi, s = n, T_i = A_i, S_2 = I$ .

(3) Search for a $d$ -dimensional invariant subspace $V = \langle v_1, v_2, ..., v_d \rangle$ of a set of matrices $\{T_i \}_{ i = 1}^N$ starting from $d = 1$ up to $d = s-1$ . If one does not exist and $k = 0$ , abort and print "no simultaneous block triangularization". Else, if one does not exist and $k\ne 0$ , go to step (8). Else, go to next step.

(4) Set $V_{k+1} = (S_2 v_1\; S_2 v_2\; ...\; S_2 v_d), \mathcal{B} = \mathcal{B} \cup \{S_2 v_1, S_2 v_2, ..., S_2 v_d\}, S_1 = (V_1\; V_2\; ...\; V_{k+1})$ .

(5) Find a basis $\{u_1, u_2, ..., u_l \}$ for the orthogonal complement of $\mathcal{B}$ .

(6) Set $S_2 = (u_1\; u_2\; ...\; u_l), S = (S_1\; S_2)$ , and

$T_i = \left({\begin{array}{*{20}{c}} \textbf{0}_{(s-d)\times d} & \textbf{I}_{s-d} \end{array}} \right)S^{-1}A_i S \left({\begin{array}{*{20}{c}} \textbf{0}_{d \times (s-d)} \\ \textbf{I}_{s-d} \end{array}} \right)$ .

(7) Set $k = k+1, s = s-d$ , and return to step (3).

(8) Compute the QR decomposition of the invertible matrix $S$ , by means of the Gram–Schmidt process, to convert it to a unitary matrix $Q$ .

(9) Output: a unitary matrix $U$ as the conjugate transpose of the resulting matrix $Q$ .

Remark 3.4. If one uses any non-orthogonal complement in step 5 of Algorithm $A$ , then the matrix $S$ is invertible such that $S^{-1} \Omega S$ is $BT(n_1, ..., n_t)$ . However, in such a case, one cannot guarantee that $U \Omega U^{*}$ is $BT(n_1, ..., n_t)$ .

Example 3.5. The set of matrices $\Omega = \{A_i \}_{i = 1}^2$ admits simultaneous block triangularization where

$\begin{equation} A_1 = \left( \begin{array}{cccccc} 3&2&1&0&1&1\\ 0&5&0&0&0&0\\ 0&1&4&0&1&2\\ 1&3&1&1&1&3\\ 0&2&0&0&2&5\\ 0&1&0&0&0&6 \end{array} \right) , A_2 = \left( \begin{array}{cccccc} 44&12&4&-4&8&4\\ 0&36&0&0&0&-1\\ 0&12&32&0&4&4\\ 4&16&8&52&4&4\\ 0&4&-1&0&28&8\\ 0&4&0&0&0 &40\end{array}\right) . \end{equation}$

(3.1)

Applying Algorithm $A$ to the set $\Omega$ can be summarized as follows:

● Input: $\Omega$ .

● Initiation step:

We have $k = 0, \mathcal{B} = \phi, s = 6, T_1 = A_1, T_2 = A_2, S_2 = I$ .

● In the first iteration:

We found two-dimensional invariant subspace $V = \langle e_1, e_4 \rangle$ of a set of matrices $\{T_i \}_{ i = 1}^2$ . Therefore, $\mathcal{B} = \{e_1, e_4\}, S_1 = (e_1, e_4), S_2 = (e_2, e_3, e_5, e_6)$ ,

$\begin{equation} T_1 = \left( \begin{array}{cccc} 5&0&0&0\\ 1&4&1&2\\ 2&0&2&5\\ 1&0&0&6 \end{array}\right) , T_2 = \left(\begin{array}{cccc} 36&0&0&-1\\ 12&32&4&4\\ 4&-1&28&8\\ 4&0&0&40 \end{array} \right), \end{equation}$

(3.2)

$k = 1$ , and $s = 4$ .

● In the second iteration: We found two-dimensional invariant subspace $V = \langle e_2, e_3 \rangle$ of a set of matrices $\{T_i \}_{ i = 1}^2$ . Therefore, $\mathcal{B} = \{e_1, e_4, e_3, e_5\}, S_1 = (e_1, e_4, e_3, e_5), S_2 = (e_2, e_6)$ ,

$\begin{equation} T_1 = \left( \begin{array}{cc} 5&0\\ 1&6 \end{array} \right) , T_2 = \left(\begin{array}{cc} 36&-1\\ 4&40\end{array}\right), \end{equation}$

(3.3)

$k = 2$ , and $s = 2$ .

● In the third iteration: There is no one-dimensional invariant subspace of a set of matrices $\{T_i \}_{ i = 1}^2$ . Therefore, $S = (e_1\; e_4\; e_3\; e_5\; e_2\; e_6)$ , and the corresponding unitary matrix is

$U = \left(\begin{array}{cccccc} 1&0&0&0&0&0\\ 0&0&0&1&0&0\\ 0&0&1&0&0&0\\ 0&0&0&0&1&0\\ 0&1&0&0&0&0\\ 0&0&0&0&0&1 \end{array} \right)$

such that the set $U \Omega U^{*} = \{U A_i U^{*}\}_{i = 1}^2$ is $BT(2, 2, 2)$ where

$\begin{equation} \begin{array}{l} U A_1 U^{*} = \left( \begin{array}{cc|cc|cc} 3&0&1&1&2&1\\ 1&1&1&1&3&3\\ \hline 0&0&4&1&1&2\\ 0&0&0&2&2&5\\ \hline 0&0&0&0&5&0\\ 0&0&0&0&1&6 \end{array}\right) ,\\ U A_2 U^{*} = \left( \begin{array}{cc|cc|cc} 44&-4&4&8&12&4\\ 4&52&8&4&16&4\\ \hline 0&0&32&4&12&4\\ 0&0&-1&28&4&8\\ \hline 0&0&0&0&36&-1\\ 0&0&0&0&4&40 \end{array}\right) .\\ \end{array} \end{equation}$

(3.4)

3.2. Algorithm $B$ : Simultaneous block diagonalization of a set of $n \times n$ matrices $\{A_i \}_{ i=1}^N$ .

(1) Input: the set $\Omega = \{A_i \}_{ i = 1}^N$ .

(2) Construct the set $\Gamma = \Omega \cup \Omega^{*}$ .

(3) Find a unitary matrix $U$ such that $U \Gamma U^{*}$ is $BT(n_1, ..., n_t)$ using Algorithm $A$ .

(4) Output: a unitary matrix $U$ .

Remark 3.6. Algorithm $B$ provides the finest block-diagonalization. Moreover, the number of the blocks equals the number the of the invariant subspaces, and the size of each block is $n_i \times n_i$ , where $n_i$ is the dimension of the invariant subspace.

Example 3.7. The set of matrices $\Omega = \{A_i \}_{i = 1}^2$ admits simultaneous block diagonalization where

$\begin{equation} A_1 = \left( \begin{array}{ccccccc} 3&0&0&0&0&0&0\\ 0&2&0&0&0&0&0\\ 0&0&2&0&0&0&0\\ 0&0&0&1&0&0&0\\ 0&0&0&0&1&0&0\\ 0&0&0&0&0&1&0\\ 0&0&0&0&0&0&3\end{array} \right) , A_2 = \left( \begin{array}{ccccccc} 0&0&0&0&0&0&0\\ 0&0&0&0&0&0&0\\ 0&1&0&0&0&0&0\\ 0&0&0&0&0&0&0\\ 0&0&0&0&0&0&0\\ 0&0&0&1&0&0&0\\ 1&0&0&0&0&0&0\end{array} \right) . \end{equation}$

(3.5)

Applying Algorithm $B$ to the set $\Omega$ can be summarized as follows:

● Input: $\Gamma = \Omega \cup \Omega^{*}$ .

● Initiation step:

We have $k = 0, \mathcal{B} = \phi, s = 7, T_1 = A_1, T_2 = A_2, T_3 = A_2^T, S_2 = I$ .

● In the first iteration:

We found one-dimensional invariant subspace $V = \langle e_5 \rangle$ of a set of matrices $\{T_i \}_{ i = 1}^3$ . Therefore, $\mathcal{B} = \{e_5\}, S_1 = (e_5), S_2 = (e_1, e_2, e_3, e_4, e_6, e_7)$ ,

$\begin{equation} T_1 = \left( \begin{array}{cccccc} 3&0&0&0&0&0\\ 0&2&0&0&0&0\\ 0&0&2&0&0&0\\ 0&0&0&1&0&0\\ 0&0&0&0&1&0\\ 0&0&0&0&0&3 \end{array} \right) , T_2 = \left( \begin{array}{cccccc} 0&0&0&0&0&0\\ 0&0&0&0&0&0\\ 0&1&0&0&0&0\\ 0&0&0&0&0&0\\ 0&0&0&1&0&0\\ 1&0&0&0&0&0 \end{array} \right), T_3 = T_2^T, \end{equation}$

(3.6)

$k = 1$ , and $s = 6$ .

● In the second iteration: We found two-dimensional invariant subspace $V = \langle e_4, e_5 \rangle$ of a set of matrices $\{T_i \}_{ i = 1}^3$ . Therefore, $\mathcal{B} = \{e_5, e_4, e_6\}, S_1 = (e_5\; e_4\; e_6), S_2 = (e_1, e_2, e_3, e_7)$ ,

$\begin{equation} T_1 = \left( \begin{array}{cccc} 3&0&0&0\\ 0&2&0&0\\ 0&0&2&0\\ 0&0&0&3 \end{array} \right) , T_2 = \left( \begin{array}{cccc} 0&0&0&0\\ 0&0&0&0\\ 0&1&0&0\\ 1&0&0&0 \end{array} \right), T_3 = T_2^T, \end{equation}$

(3.7)

$k = 2$ , and $s = 4$ .

● In the third iteration: We found two-dimensional invariant subspace $V = \langle e_2, e_3 \rangle$ of a set of matrices $\{T_i \}_{ i = 1}^3$ . Therefore, $\mathcal{B} = \{e_5, e_4, e_6, e_2, e_3\}, S_1 = (e_5\; e_4\; e_6\; e_2\; e_3), S_2 = (e_1, e_7)$ ,

$\begin{equation} T_1 = \left( \begin{array}{cc} 3&0\\ 0&3 \end{array} \right) , T_2 = \left( \begin{array}{cc} 0&0\\ 1&0 \end{array} \right), T_3 = \left( \begin{array}{cc} 0&1\\ 0&0 \end{array} \right), \end{equation}$

(3.8)

$k = 3$ , and $s = 2$ .

● In the fourth iteration: There is no one-dimensional invariant subspace of a set of matrices $\{T_i \}_{ i = 1}^3$ . Therefore, $S = (e_5\; e_4\; e_6\; e_2\; e_3\; e_1\; e_7)$ , and the corresponding unitary matrix is

$U = \left( \begin{array}{ccccccc} 0&0&0&0&1&0&0\\ 0&0&0&1&0&0&0\\ 0&0&0&0&0&1&0\\ 0&1&0&0&0&0&0\\ 0&0&1&0&0&0&0\\ 1&0&0&0&0&0&0\\ 0&0&0&0&0&0&1 \end{array} \right)$

such that the set $U \Omega U^{*} = \{U A_i U^{*}\}_{i = 1}^2$ is $BD(1, 2, 2, 2)$ where

$\begin{equation} \begin{array}{l} U A_1 U^{*} = \left( \begin{array}{c} 1\end{array} \right)\oplus \left( \begin{array}{cc} 1&0\\ 0&1\end{array} \right) \oplus \left( \begin{array}{cc} 2&0\\ 0&2\end{array} \right) \oplus \left( \begin{array}{cc} 3&0\\ 0&3\end{array} \right) ,\\ U A_2 U^{*} = \left( \begin{array}{c} 0\end{array} \right)\oplus \left( \begin{array}{cc} 0&0\\ 1&0\end{array} \right) \oplus \left( \begin{array}{cc} 0&0\\ 1&0\end{array} \right) \oplus \left( \begin{array}{cc} 0&0\\ 1&0\end{array} \right) .\\ \end{array} \end{equation}$

(3.9)

Example 3.8. The set of matrices $\Omega = \{A_i \}_{i = 1}^2$ admits simultaneous block diagonalization where

$\begin{equation} A_1 = \left( \begin{array}{ccccccc} 3&0&0&0&0&0&0\\ 0&2&0&0&0&0&0\\ 0&0&2&0&0&0&0\\ 0&0&0&1&0&0&0\\ 0&0&0&0&1&0&0\\ 0&0&0&0&0&1&0\\ 0&0&0&0&0&0&3 \end{array} \right) , A_2 = \left( \begin{array}{ccccccc} 0&0&0&0&0&0&0\\ 0&0&0&1&0&0&0\\ 0&1&0&0&0&0&0\\ 0&0&0&0&0&0&0\\ 0&0&0&0&1&0&0\\ 0&0&0&0&1&0&0\\ 1&0&0&0&0&0&0\end{array} \right) . \end{equation}$

(3.10)

Similarly, applying Algorithm $B$ to the set $\Omega$ provides the matrix $S = (e_6\; e_5\; e_7\; e_1\; e_3\; e_2\; e_4)$ . Therefore, the corresponding unitary matrix is

$U = \left( \begin{array}{ccccccc} 0&0&0&0&0&1&0\\ 0&0&0&0&1&0&0\\ 0&0&0&0&0&0&1\\ 1&0&0&0&0&0&0\\ 0&0&1&0&0&0&0\\ 0&1&0&0&0&0&0\\ 0&0&0&1&0&0&0 \end{array} \right)$

such that the set $U \Omega U^{*} = \{U A_i U^{*}\}_{i = 1}^2$ is $BD(2, 2, 3)$ where

$\begin{equation} \begin{array}{l} U A_1 U^{*} = \left( \begin{array}{cc} 1&0\\ 0&1\end{array}\right) \oplus \left( \begin{array}{cc} 3&0\\ 0&3\end{array}\right)\oplus \left( \begin{array}{ccc} 2&0&0\\ 0&2&0\\ 0&0&1\end{array} \right) ,\\ U A_2 U^{*} = \left( \begin{array}{cc} 0&1\\ 0&1\end{array}\right) \oplus \left( \begin{array}{cc} 0&1\\ 0&0\end{array}\right)\oplus \left( \begin{array}{ccc} 0&1&0\\ 0&0&1\\ 0&0&0\end{array} \right) .\\ \end{array} \end{equation}$

(3.11)

Example 3.9. The set of matrices $\Omega = \{A_i \}_{i = 1}^3$ admits simultaneous block diagonalization where

$\begin{equation} \begin{array}{ll} A_1 = \left( \begin{array}{ccccccccc} 0&0&0&0&0&0&0&0&0\\ 0&2&0&0&0&0&0&0&0\\ 0&0&1&0&0&0&0&0&0\\ 0&0&0&-2&0&0&0&0&0\\ 0&0&0&0&0&0&0&0&0\\ 0&0&0&0&0&-1&0&0&0\\ 0&0&0&0&0&0&-1&0&0\\ 0&0&0&0&0&0&0&1&0\\ 0&0&0&0&0&0&0&0&0 \end{array} \right) , A_2 = \left( \begin{array}{ccccccccc} 0&0&0&1&0&0&0&0&0\\ -1&0&0&0&1&0&0&0&0\\ 0&0&0&0&0&1&0&0&0\\ 0&0&0&0&0&0&0&0&0\\ 0&0&0&-1&0&0&0&0&0\\ 0&0&0&0&0&0&0&0&0\\ 0&0&0&0&0&0&0&0&0\\ 0&0&0&0&0&0&-1&0&0\\ 0&0&0&0&0&0&0&0&0 \end{array} \right),\\ A_3 = \left( \begin{array}{ccccccccc} 0&-1&0&0&0&0&0&0&0\\ 0&0&0&0&0&0&0&0&0\\ 0&0&0&0&0&0&0&0&0\\ 1&0&0&0&-1&0&0&0&0\\ 0&1&0&0&0&0&0&0&0\\ 0&0&1&0&0&0&0&0&0\\ 0&0&0&0&0&0&0&-1&0\\ 0&0&0&0&0&0&0&0&0\\ 0&0&0&0&0&0&0&0&0 \end{array} \right). \end{array} \end{equation}$

(3.12)

Similarly, applying Algorithm $B$ to the set $\Omega$ provides the matrix $S = (e_1+e_5\; e_9\; e_3\; e_6\; e_8\; -e_7\; e_1-e_5, e_2\; e_4)$ . Therefore, the corresponding unitary matrix is

$U = \left( \begin{array}{ccccccccc} \frac{1}{2\sqrt{2}}&0&0&0& \frac{1}{2\sqrt{2}}&0&0&0&0\\ 0&0&0&0&0&0&0&0&1\\ 0&0&1&0&0&0&0&0&0\\ 0&0&0&0&0&1&0&0&0\\ 0&0&0&0&0&0&0&1&0\\ 0&0&0&0&0&0&-1&0&0\\ \frac{1}{2\sqrt{2}}&0&0&0&- \frac{1}{2\sqrt{2}}&0&0&0&0\\ 0&1&0&0&0&0&0&0&0\\ 0&0&0&1&0&0&0&0&0 \end{array} \right)$

such that the set $U \Omega U^{*} = \{U A_i U^{*}\}_{i = 1}^3$ is $BD(1, 1, 2, 2, 3)$ where

$\begin{equation} \begin{array}{l} U A_1 U^{*} = \left( \begin{array}{c} 0\end{array} \right)\oplus \left( \begin{array}{c} 0\end{array} \right) \oplus \left( \begin{array}{cc} 1&0\\ 0&-1\end{array}\right) \oplus \left( \begin{array}{cc} 1&0\\ 0&-1\end{array}\right)\oplus \left( \begin{array}{ccc} 0&0&0\\ 0&2&0\\ 0&0&-2\end{array} \right) ,\\ U A_2 U^{*} = \left( \begin{array}{c} 0\end{array} \right)\oplus \left( \begin{array}{c} 0\end{array} \right) \oplus \left( \begin{array}{cc} 0&1\\ 0&0\end{array}\right) \oplus \left( \begin{array}{cc} 0&1\\ 0&0\end{array}\right) \oplus \left( \begin{array}{ccc} 0&0&\sqrt{2}\\ -\sqrt{2}&0&0\\ 0&0&0\end{array} \right) ,\\ U A_3 U^{*} = \left( \begin{array}{c} 0\end{array} \right)\oplus \left( \begin{array}{c} 0\end{array} \right) \oplus \left( \begin{array}{cc} 0&0\\ 1&0\end{array} \right) \oplus \left( \begin{array}{cc} 0&0\\ 1&0\end{array} \right) \oplus \left( \begin{array}{ccc} 0&-\sqrt{2}&0\\ 0&0&0\\ \sqrt{2}&0&0\end{array} \right) .\\ \end{array} \end{equation}$

(3.13)

4. Algorithm for simultaneous block diagonalization of a set of matrices based on a commuting matrix

This section focuses on an alternate approach for simultaneous block diagonalization of a set of $n \times n$ matrices $\{A_s\}_{ s = 1}^N$ by an invertible matrix that does not require finding the common invariant subspaces as Algorithm $B$ given in the previous section. Maehara et al. ^[9] introduced an algorithm for simultaneous block diagonalization of a set of matrices by a unitary matrix based on the eigenvalue decomposition of a Hermitian commuting matrix. Here, we extend their algorithm to be applicable for a non-Hermitian commuting matrix by considering its generalized eigen vectors. Moreover, a new characterization is presented by proving that the existence of a commuting matrix that possesses at least two distinct eigenvalues is the necessary and sufficient condition to guarantee the simultaneous block diagonalization by an invertible matrix.

Proposition 4.1. Let $V$ be a vector space, and let $T:V \rightarrow V$ be a linear operator. Let $\lambda_1, ..., \lambda_k$ be distinct eigenvalues of $T$ . Then, each generalized eigenspace $G_{\lambda_i}(T)$ is $T$ -invariant, and we have the direct sum decomposition

$V = G_{\lambda_1}(T)\oplus G_{\lambda_2}(T) \oplus ...\oplus G_{\lambda_k}(T).$

Lemma 4.2. Let $V$ be a vector space, and let $T:V \rightarrow V$ , $L:V \rightarrow V$ be linear commuting operators. Let $\lambda_1, ..., \lambda_k$ be distinct eigenvalues of $T$ . Then, each generalized eigenspace $G_{\lambda_i}(T)$ is $L$ -invariant.

Proof. Let $V$ be a vector space and $\lambda_1, ..., \lambda_k$ be distinct eigenvalues of $T$ with the minimal polynomial $\mu(x) = (x-\lambda_1)^{n_1}(x-\lambda_2)^{n_2}...(x-\lambda_k)^{n_k}$ . Then, we have the direct sum decomposition $V = G_{\lambda_1}(T)\oplus G_{\lambda_2}(T) \oplus...\oplus G_{\lambda_k}(T)$ .

For each $i = 1, .., k$ , let $x \in G_{\lambda_i}(T)$ , and then $(T-\lambda_i I)^{n_i}x = 0$ . Then, $(T-\lambda_i I)^{n_i}Lx = L(T-\lambda_i I)^{n_i}x = 0$ . Hence, $L x \in G_{\lambda_i}(T)$ .

Theorem 4.3. Let $\{A_s\}_{ s = 1}^N$ be a set of $n \times n$ matrices. Then, the set $\{A_s\}_{ s = 1}^N$ admits simultaneous block diagonalization by an invertible matrix $S$ if and only if the set $\{A_s\}_{ s = 1}^N$ commutes with a matrix $C$ that possesses two distinct eigenvalues.

Proof. $\Rightarrow$ Assume that the set $\{A_s\}_{ s = 1}^N$ admits simultaneous block diagonalization by the an invertible matrix $S$ such that

$S^{-1} A_s S = B_{s,1} \oplus B_{s,2} \oplus ... \oplus B_{s,k},$

where the number of blocks $k\geq 2$ , and the matrices $B_{s, 1}, B_{s, 2}, ..., B_{s, k}$ have sizes $n_1 \times n_1, n_2 \times n_2, ..., n_k \times n_k$ , respectively, for all $s = 1, .., N$ .

Now, define the matrix $C$ as

$C = S (\lambda_1 I_{n_1 \times n_1} \oplus \lambda_2 I_{n_2 \times n_2} \oplus ... \oplus \lambda_k I_{n_k \times n_k}) S^{-1},$

where $\lambda_1, \lambda_2, ..., \lambda_k$ are any distinct numbers.

Clearly, the matrix $C$ commutes with the set $\{A_s\}_{ s = 1}^N$ . Moreover, it has the distinct eigenvalues $\lambda_1, \lambda_2, ..., \lambda_k$ .

$\Leftarrow$ Assume that the set $\{A_s\}_{ s = 1}^N$ commutes with a matrix $C$ that posseses distinct eigenvalues $\lambda_1, \lambda_2, ..., \lambda_k$ .

Using Proposition 4.1, one can use the generalized eigenspace $G_{\lambda_i}(C)$ of the matrix $C$ associated to these distinct eigenvalues to decompose the matrix $C$ as a direct sum of $k$ matrices. This can be achieved by restricting the matrix $C$ on the invariant subspaces $G_{\lambda_i}(C)$ as follows:

$S^{-1}{C}S = {\big[ C \big]}_{G_{\lambda_1}(C)} \oplus {\big[ C \big]}_{G_{\lambda_2}(C)} \oplus ... \oplus {\big[ C \big]}_{G_{\lambda_k}(C)}$

where

$S = \big( G_{\lambda_1}(C), G_{\lambda_2}(C) ,...,G_{\lambda_k}(C) \big).$

Using Lemma 4.2, one can restrict each matrix $A_s$ on the invariant subspaces $G_{\lambda_i}(C)$ to decompose the matrix $A_s$ as a direct sum of $k$ matrices as follows:

$S^{-1}{A_s}S = {\big[ A_s \big]}_{G_{\lambda_1}(C)} \oplus {\big[ A_s \big]}_{G_{\lambda_2}(C)} \oplus ... \oplus {\big[ A_s \big]}_{G_{\lambda_k}(C)}.$

Remark 4.4. For a given set of $n \times n$ matrices $\{A_s\}_{ s = 1}^N$ , if the set $\{A_s\}_{ s = 1}^N$ commutes only with the matrices having only one eigenvalue, then it does not admit a simultaneous block diagonalization by an invertible matrix.

Algorithm $C$ :

(1) Input: the set $\Omega = \{A_s \}_{ s = 1}^N$ .

(2) Construct the the following matrix:

$\begin{equation*} X = \left( \begin{array}{c} I \otimes A_1 -A_1^T \otimes I \\ I \otimes A_2 -A_2^T \otimes I \\ .\\ .\\ .\\ I \otimes A_N -A_N^T \otimes I \\ \end{array} \right) . \end{equation*}$

(3) Compute the null space of the matrix $X$ and reshape the obtained vectors as $n \times n$ matrices. These matrices commute with all the matrices $\{A_s \}_{ s = 1}^N$ .

(4) Choose a matrix $C$ from the obtained matrices that possesses two distinct eigenvalues.

(5) Find the distinct eigenvalues $\lambda_1, ..., \lambda_k$ of the matrix $C$ and the corresponding algebraic multiplicity $n_1, n_2, ..., n_k$ .

(6) Find each generalized eigenspace $G_{\lambda_i}(C)$ of the matrix $C$ associated to the eigenvalue $\lambda_i$ by computing the null space of $(C-\lambda_i I)^{n_i}$ .

(7) Construct the invertible matrix $S$ as

$S = \big( G_{\lambda_1}(C), G_{\lambda_2}(C) ,...,G_{\lambda_k}(C) \big).$

(8) Verify that

$S^{-1} A_s S = B_{s,1} \oplus B_{s,2} \oplus ... \oplus B_{s,k},$

where the matrices $B_{s, 1}, B_{s, 2}, ..., B_{s, k}$ have sizes $n_1 \times n_1, n_2 \times n_2, ..., n_k \times n_k$ , respectively, for all $s = 1, .., N$ .

(9) Output: an invertible matrix $S$ .

Remark 4.5. Algorithm $C$ provides the finest block-diagonalization if one chooses a matrix $C$ with maximum number of distinct eigenvalues. Moreover, the number of the blocks equals the number the of the distinct eigenvalues, and the size of each block is $n_i \times n_i$ , where $n_i$ is the algebraic multiplicity of the eigenvalue $\lambda_i$ .

Example 4.6. Consider the set of matrices $\Omega = \{A_i \}_{i = 1}^6$ where

$\begin{equation} \begin{array}{l} A_1 = \left( \begin{array}{cccccc} 0&0&0&0&0&0\\ 0&0&0&1&0&0\\ 0&0&0&0&1&0\\ 0&-1&0&0&0&0\\ 0&0&-1&0&0&0\\ 0&0&0&0&0&0 \end{array} \right) , A_2 = \left( \begin{array}{cccccc} 0&0&0&-1&0&0\\ 0&0&0&0&0&0\\ 0&0&0&0&0&1\\ 1&0&0&0&0&0\\ 0&0&0&0&0&0\\ 0&0&-1&0&0&0 \end{array} \right), A_3 = \left( \begin{array}{cccccc} 0&0&0&0&-1&0\\ 0&0&0&0&0&-1\\ 0&0&0&0&0&0\\ 0&0&0&0&0&0\\ 1&0&0&0&0&0\\ 0&1&0&0&0&0 \end{array} \right) , \\ A_4 = \left( \begin{array}{cccccc} 0&1&0&0&0&0\\ -1&0&0&0&0&0\\ 0&0&0&0&0&0\\ 0&0&0&0&0&0\\ 0&0&0&0&0&1\\ 0&0&0&0&-1&0 \end{array} \right), A_5 = \left( \begin{array}{cccccc} 0&0&1&0&0&0\\ 0&0&0&0&0&0\\ -1&0&0&0&0&0\\ 0&0&0&0&0&-1\\ 0&0&0&0&0&0\\ 0&0&0&1&0&0 \end{array} \right), A_6 = \left( \begin{array}{cccccc} 0&0&0&0&0&0\\ 0&0&1&0&0&0\\ 0&-1&0&0&0&0\\ 0&0&0&0&1&0\\ 0&0&0&-1&0&0\\ 0&0&0&0&0&0 \end{array} \right). \end{array} \end{equation}$

(4.1)

The set $\Omega$ admits simultaneous block diagonalization by an invertible matrix. An invertible matrix can be obtained by applying algorithm $C$ to the set $\Omega$ as summarized below:

● A matrix $C$ that commutes with all the matrices $\{A_i \}_{ i = 1}^6$ can be obtained as

$\begin{equation} C = \left( \begin{array}{cccccc} 0&0&0&0&0&1\\ 0&0&0&0&-1&0\\ 0&0&0&1&0&0\\ 0&0&1&0&0&0\\ 0&-1&0&0&0&0\\ 1&0&0&0&0&0 \end{array} \right) . \end{equation}$

(4.2)

● The distinct eigenvalues of the matrix $C$ are $\lambda_1 = -1, \lambda_2 = 1$ with algebraic multiplicities $n_1 = 3, n_2 = 3$ , respectively..

● The generalized eigenspaces of the matrix $C$ associated to the distinct eigenvalues are

$\begin{equation} \begin{array}{l} G_{\lambda_1}(C) = \mathcal{N}(C-\lambda_1 I)^3 = \langle e_6-e_1,e_2+e_5,e_4-e_3\rangle,\\ G_{\lambda_2}(C) = \mathcal{N}(C-\lambda_2 I)^3 = \langle e_1+e_6, e_5-e_2,e_3+e_4 \rangle.\\ \end{array} \end{equation}$

(4.3)

● The invertible matrix $S = \big(G_{\lambda_1}(C), G_{\lambda_2}(C) \big)$ is

$\begin{equation} S = \left( \begin{array}{cccccc} -1&0&0&1&0&0\\ 0&1&0&0&-1&0\\ 0&0&-1&0&0&1\\ 0&0&1&0&0&1\\ 0&1&0&0&1&0\\ 1&0&0&1&0&0 \end{array} \right). \end{equation}$

(4.4)

● The set $S^{-1} \Omega S = \{S^{-1} A_i S\}_{i = 1}^6$ contains block diagonal matrices where

$\begin{equation} \begin{array}{ll} S^{-1} A_1 S = \left( \begin{array}{ccc} 0&0&0\\ 0&0&1\\ 0&-1&0 \end{array} \right) \oplus \left( \begin{array}{ccc} 0&0&0\\ 0&0&-1\\ 0&1&0 \end{array} \right),& S^{-1} A_2 S = \left( \begin{array}{ccc} 0&0&1\\ 0&0&0\\ -1&0&0 \end{array} \right) \oplus \left( \begin{array}{ccc} 0&0&-1\\ 0&0&0\\ 1&0&0 \end{array} \right),\\ S^{-1} A_3 S = \left( \begin{array}{ccc} 0&1&0\\ -1&0&0\\ 0&0&0 \end{array} \right) \oplus \left( \begin{array}{ccc} 0&-1&0\\ 1&0&0\\ 0&0&0 \end{array} \right),& S^{-1} A_4 S = \left( \begin{array}{ccc} 0&-1&0\\ 1&0&0\\ 0&0&0 \end{array} \right) \oplus \left( \begin{array}{ccc} 0&-1&0\\ 1&0&0\\ 0&0&0 \end{array} \right),\\ S^{-1} A_5 S = \left( \begin{array}{ccc} 0&0&1\\ 0&0&0\\ -1&0&0 \end{array} \right) \oplus \left( \begin{array}{ccc} 0&0&1\\ 0&0&0\\ -1&0&0 \end{array} \right),& S^{-1} A_6 S = \left( \begin{array}{ccc} 0&0&0\\ 0&0&-1\\ 0&1&0 \end{array} \right) \oplus \left( \begin{array}{ccc} 0&0&0\\ 0&0&-1\\ 0&1&0 \end{array} \right).\\ \end{array} \end{equation}$

(4.5)

5. Conclusions

It is well known that a set of non-defective matrices can be simultaneously diagonalized if and only if the matrices commute. In the case of non-commuting matrices, the best that can be achieved is simultaneous block diagonalization. Both Algorithm B and the Maehara et al. ^[9] algorithm are applicable for simultaneous block diagonalization of a set of matrices by a unitary matrix. Algorithm C can be applied for block diagonalization by an invertible matrix when finding a unitary matrix is not possible. In case block diagonalization of a set of matrices is not possible by a unitary or an invertible matrix, then one may utilize block triangularization by Algorithm A. Algorithms A and B are based on the existence of invariant subspaces; however, Algorithm C is based on the existence of a commuting matrix which is not necessarily Hermitian, unlike the Maehara et al. algorithm.

Use of AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

Ahmad Y. Al-Dweik and M. T. Mustafa would like to thank Qatar University for its support and excellent research facilities. R. Ghanam and G. Thompson are grateful to VCU Qatar and Qatar Foundation for their support.

Conflict of Interest

The authors declare that they have no conflicts of interest.

Appendix: Maple procedures

Figure Listing 1. Step 5 in Algorithm

$A$ .

DownLoad: Full-Size Img PowerPoint

Figure Listing 2. Step 6 in Algorithm

$A$ .

DownLoad: Full-Size Img PowerPoint

Figure Listing 3. Steps 8 & 9 in Algorithm

$A$ .

DownLoad: Full-Size Img PowerPoint

Figure Listing 4. Steps 2 & 3 in Algorithm

$C$ .

DownLoad: Full-Size Img PowerPoint

Figure Listing 5. Steps 6 & 7 in Algorithm

$C$ .

DownLoad: Full-Size Img PowerPoint

References

[1]	M. Lu, Y. Xu, H. Li, Vehicle Re-Identification based on UAV viewpoint: dataset and method, Remote Sens., 14 (2022), 4630. https://doi.org/10.3390/rs14184603 doi: 10.3390/rs14184603
[2]	S. Ijlil, A. Essahlaoui, M. Mohajane, N. Essahlaoui, E. M. Mili, A. V. Rompaey, Machine learning algorithms for modeling and mapping of groundwater pollution risk: A study to reach water security and sustainable development (Sdg) goals in a editerranean aquifer system, Remote Sens., 14 (2022), 2379. https://doi.org/10.3390/rs14102379 doi: 10.3390/rs14102379
[3]	Z. Jiang, Z. Song, Y. Bai, X. He, S. Yu, S. Zhang, et al., Remote sensing of global sea surface pH based on massive underway data and machine mearning, Remote Sens., 14 (2022), 2366. https://doi.org/10.3390/rs14102366 doi: 10.3390/rs14102366
[4]	Y. Zhao, L. Ge, H. Xie, G. Bai, Z. Zhang, Q. Wei, et al., ASTF: Visual abstractions of time-varying patterns in radio signals, IEEE Trans. Visual Comput. Graphics, 29 (2023), 214–224. https://doi.org/10.1109/TVCG.2022.3209469 doi: 10.1109/TVCG.2022.3209469
[5]	R. Girshick, J. Donahue, T. Darrell J. Malik, Rich feature hierarchies for accurate object detection and semantic segmentation, in 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2014), 580–587. https://doi.org/10.1109/CVPR.2014.81
[6]	R. Girshick, Fast R-CNN, in 2015 IEEE International Conference on Computer Vision (ICCV), (2015), 1440–1448. https://doi.org/10.1109/ICCV.2015.169
[7]	S. Ren, K. He, R. Girshick, J. Sun, Faster R-CNN: Towards real-time object detection with region proposal networks, IEEE Trans. Pattern Anal. Mach. Intell., 39 (2017), 1137–1149. https://doi.org/10.1109/TPAMI.2016.2577031 doi: 10.1109/TPAMI.2016.2577031
[8]	J. Redmon, S. Divvala, R. Girshick, A. Farhadi, You only look once: Unified, real-time object detection, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2016), 779–788. https://doi.org/10.1109/CVPR.2016.91
[9]	J. Redmon, A. Farhadi, YOLO9000: Better, Faster, Stronger, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2017), 6517–6525. https://doi.org/10.1109/CVPR.2017.690
[10]	J. Redmon, A. Farhadi, YOLOv3: an incremental improvement, arXiv preprint, (2018), arXiv: 1804.02767. http://arXiv.org/abs/1804.02767
[11]	A. Bochkovskiy, C. Y. Wang, H. Liao, YOLOv4: optimal speed and accuracy of object detection, arXiv preprint, (2020), arXiv: 2004.10934. http://arXiv.org/abs/2004.10934
[12]	G. Jocher, Yolov5, 2020. Available from: https://github.com/ultralytics/yolov5.
[13]	Z. Ge, S. Liu, F. Wang, Z. Li, J. Sun, YOLOX: Exceeding YOLO series in 2021, arXiv preprint, (2021), arXiv: 2107.08430. https://arXiv.org/abs/2107.08430
[14]	Y. Li, X. Liu, H. Zhang, X. Li, X. Sun, Optical remote sensing image retrieval based on convolutional neural networks (in Chinese), Opt. Precis. Eng., 26 (2018), 200–207. https://doi.org/10.3788/ope.20182601.0200 doi: 10.3788/ope.20182601.0200
[15]	A. Van Etten, You only look twice: Rapid multi-scale object detection in satellite imagery, arXiv preprint, (2018), arXiv: 1805.09512. https://doi.org/10.48550/arXiv.1805.09512
[16]	M. Ahmed, Y. Wang, A. Maher, X. Bai, Fused RetinaNet for small target detection in aerial images, Int. J. Remote Sens., 43 (2022), 2813–2836. https://doi.org/10.1080/01431161.2022.2071115 doi: 10.1080/01431161.2022.2071115
[17]	H. Liu, G. Yuan, L. Yang, K. Liu, H. Zhou, An appearance defect detection method for cigarettes based on C‐CenterNet, Electronics, 11 (2022), 2182. https://doi.org/10.3390/electronics11142182 doi: 10.3390/electronics11142182
[18]	S. Du, B. Zhang, P. Zhang, P. Xiang, H. Xue, FA-YOLO: An improved YOLO model for infrared occlusion object detection under confusing background, Wireless Commun. Mobile Comput., 2021 (2021). https://doi.org/10.1155/2021/1896029 doi: 10.1155/2021/1896029
[19]	A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, et al., MobileNets: Efficient convolutional neural networks for mobile vision applications, arXiv preprint, (2017), arXiv: 1704.04861. https://doi.org/10.48550/arXiv.1704.04861
[20]	M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, L. C. Chen, MobileNetV2: Inverted residuals and linear bottlenecks, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2018), 4510–4520. https://doi.org/10.1109/CVPR.2018.00474
[21]	A. Howard, M. Sandler, B. Chen, W. Wang, L. C. Chen, M. Tan, et al., Searching for mobileNetV3, in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), (2019), 1314–1324. https://doi.org/10.1109/ICCV.2019.00140
[22]	X. Zhang, X. Zhou, M. Lin, J. Sun, ShuffleNet: An extremely efficient convolutional neural network for mobile devices, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2018), 6848–6856.
[23]	N. Ma, X. Zhang, H. T. Zheng, J. Sun, ShuffleNet V2: Practical guidelines for efficient CNN architecture design, in European Conference on Computer Vision (ECCV), (2018), 122–138. https://doi.org/10.1109/CVPR.2018.00716
[24]	RangiLyu, NanoDet-Plus: Super fast and high accuracy lightweight anchor-free object detection model, 2021. Available from: https://github.com/RangiLyu/nanodet.
[25]	C. Y. Wang, H. Liao, Y. H. Wu, P. Y. Chen, J. W. Hsieh, I. H. Yeh, CSPNet: A bew backbone that can enhance learning capability of CNN, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), (2020), 1571–1580. https://doi.org/10.1109/CVPRW50498.2020.00203
[26]	X. Luo, Y. Wu, L. Zhao, YOLOD: A target detection method for UAV aerial imagery, Remote Sens., 14 (2022), 3240. https://doi.org/10.3390/rs14143240 doi: 10.3390/rs14143240
[27]	D. Yan, G. Li, X. Li, H. Zhang, H. Lei, K. Lu, et al., An improved faster R-CNN method to detect tailings ponds from high-resolution remote sensing images, Remote Sens. 13 (2021), 2052. https://doi.org/10.3390/rs13112052 doi: 10.3390/rs13112052
[28]	F. C. Akyon, S. O. Altinuc, A. Temizel, Slicing aided hyper inference and fine-tuning for small object detection, in 2022 IEEE International Conference on Image Processing (ICIP), (2022), 966–970. https://doi.org/10.1109/ICIP46576.2022.9897990
[29]	L. Yang, G. Yuan, H. Zhou, H. Liu, J. Chen, H. Wu, RS-YOLOX: A high-precision detector for object detection in satellite remote sensing images, Appli. Sci., 12 (2022), 8707. https://doi.org/10.3390/app12178707 doi: 10.3390/app12178707
[30]	J. Liu, C. Liu, Y. Wu, Z. Sun, H. Xu, Insulators' identification and missing defect detection in aerial images based on cascaded YOLO models, Comput. Intell. Neurosci., 2022 (2022). https://doi.org/10.1155/2022/7113765 doi: 10.1155/2022/7113765
[31]	X. Li, Y. Qin, F. Wang, F. Guo, J. T. W. Yeow, Pitaya detection in orchards using the MobileNet-YOLO model, in 2020 39th Chinese Control Conference (CCC), (2020), 6274–6278. https://doi.org/10.23919/CCC50068.2020.9189186
[32]	Z. Tian, C. Shen, H. Chen, T. He, FCOS: Fully convolutional one-stage object detection, in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), (2019), 9626–9635. https://doi.org/10.1109/ICCV.2019.00972
[33]	H. Law, J. Deng, CornerNet: Detecting objects as paired keypoints, Int. J. Comput. Vision, 128 (2020), 642–656. https://doi.org/10.1007/s11263-019-01204-1 doi: 10.1007/s11263-019-01204-1
[34]	G. Song, Y. Liu, X. Wang, Revisiting the sibling head in object detector, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2020), 11563–11572.
[35]	K. He, X. Zhang, S. Ren, J. Sun, Spatial pyramid pooling in deep convolutional networks for visual recognition, IEEE Trans. Pattern Anal. Mach. Intell., 37 (2015), 1904–1916. https://doi.org/10.1109/TPAMI.2015.2389824 doi: 10.1109/TPAMI.2015.2389824
[36]	L. C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, A. L. Yuille, DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs, IEEE Trans. Pattern Anal. Mach. Intell., 40 (2018), 834–848. https://doi.org/10.1109/TPAMI.2017.2699184 doi: 10.1109/TPAMI.2017.2699184
[37]	C. Y. Wang, A. Bochkovskiy, H. Liao, YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2023), 7464–7475.
[38]	H. Li, J. Li, H. Wei, Z. Liu, Z. Zhan, Q. Ren, Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles, arXiv preprint, (2022), arXiv: 2206.02424. https://doi.org/10.48550/arXiv.2206.02424
[39]	V. Dumoulin, F. Visin, A guide to convolution arithmetic for deep learning, arXiv preprint, (2018), arXiv: 1603.07285. https://doi.org/10.48550/arXiv.1603.07285
[40]	F. Yu, V. Koltun, T. Funkhouser, Dilated residual networks, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2017), 636–644. https://doi.org/10.1109/CVPR.2017.75
[41]	Q. Wang, B. Wu, P. Zhu, P. Li, W. Zuo, Q. Hu, ECA-Net: Efficient channel attention for deep convolutional neural networks, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2020), 11531–11539. https://doi.org/10.1109/CVPR42600.2020.01155
[42]	B. Jiang, R. Luo, J. Mao, T. Xiao, Y. Jiang, Acquisition of localization confidence for accurate object detection, in Proceedings of the European Conference on Computer Vision (ECCV), (2018), 784–799.
[43]	J. He, S. Erfani, X. Ma, J. Bailey, Y. Chi, X. S. Hua, Alpha-IoU: A family of power intersection over union losses for bounding box regression, in NeurIPS 2021 Conference, 2021.
[44]	H. Rezatofighi, N. Tsoi, J. Gwak, A. Sadeghian, I. Reid, S. Savarese, Generalized intersection over union: A metric and a loss for bounding box regression, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2019), 658–666. https://doi.org/10.1109/CVPR.2019.00075
[45]	Z. Zheng, P. Wang, W. Liu, J. Li, R. Ye, D. Ren, Distance-IoU loss: Faster and better learning for bounding box regression, in Proceedings of the AAAI Conference on Artificial Intelligence, 2020. https://doi.org/10.1609/aaai.v34i07.6999
[46]	Z. Gevorgyan, SIoU loss: More powerful learning for bounding box regression, arXiv preprint, (2022), arXiv: 2205.12740. https://doi.org/10.48550/arXiv.2205.12740
[47]	G. Cheng, P. Zhou, J. Han, Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images, IEEE Trans. Geosci. Remote Sens., 54 (2016), 7405–7415. https://doi.org/10.1109/TGRS.2016.2601622 doi: 10.1109/TGRS.2016.2601622
[48]	Y. Long, Y. Gong, Z. Xiao, Q. Liu, Accurate object localization in remote sensing images based on convolutional neural networks, IEEE Trans. Geosci. Remote Sens., 55 (2017), 2486–2498. https://doi.org/10.1109/TGRS.2016.2645610 doi: 10.1109/TGRS.2016.2645610
[49]	X. Lu, Y. Zhang, Y. Yuan, Y. Feng, Gated and axis-concentrated localization network for remote sensing object detection, IEEE Trans. Geosci. Remote Sens., 58 (2020), 179–192. https://doi.org/10.1109/TGRS.2019.2935177 doi: 10.1109/TGRS.2019.2935177
[50]	L. Yang, R. Y. Zhang, L. Li, X. Xie, SimAM: A simple, parameter-free attention module for convolutional neural networks, in Proceedings of the 38th International Conference on Machine Learning, 139 (2021), 11863–11874.
[51]	Z. Zhong, Z. Q. Lin, R. Bidart, X. Hu, I. B. Daya, Z. Li, et al., Squeeze-and-attention networks for semantic segmentatio, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2020), 13065–13074.
[52]	R. Saini, N. K. Jha, B. Das, S. Mittal, C. K. Mohan, ULSAM: Ultra-lightweight subspace attention module for compact convolutional neural networks, in Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), (2020), 1616–1625. https://doi.org/10.1109/WACV45572.2020.9093341
[53]	Y. Liu, Z. Shao, Y. Teng, N. Hoffmann, NAM: Normalization-based attention module, arXiv preprint, (2021), arXiv: 2111.12419. https://doi.org/10.48550/arXiv.2111.12419
[54]	X. Ma, Yolo-Fastest: yolo-fastest-v1.1.0, 2021. Available from: https://github.com/dog-qiuqiu/Yolo-Fastest.
[55]	X. Ma, FastestDet: Ultra lightweight anchor-free real-time object detection algorithm, 2022. Available from: https://github.com/dog-qiuqiu/FastestDet.
[56]	X. Yang, J. Yan, Z. Feng, T. He, R3Det: Refined single-stage detector with feature refinement for rotating object, in Proceedings of the AAAI Conference on Artificial Intelligence, 35 (2021), 3163–3173. https://doi.org/10.1609/aaai.v35i4.16426
[57]	J. Han, J. Ding, J. Li, G. S. Xia, Align deep features for oriented object detection, IEEE Trans. Geosci. Remote Sens., 60 (2022), 1–11. https://doi.org/10.1109/TGRS.2021.3062048 doi: 10.1109/TGRS.2021.3062048
[58]	X. Xie, G. Cheng, J. Wang, X. Yao, J. Han, Oriented R-CNN for object detection, in 2021 IEEE/CVF International Conference on Computer Vision (ICCV), (2021), 3500–3509. https://doi.org/10.1109/ICCV48922.2021.00350
[59]	J. Ding, N. Xue, Y. Long, G. S. Xia, Q. Lu, Learning RoI transformer for oriented object detection in aerial images, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (2019), 2844–2853. https://doi.org/10.1109/CVPR.2019.00296
[60]	S. Zhong, H. Zhou, Z. Ma, F. Zhang, J. Duan, Multiscale contrast enhancement method for small infrared target detection, Optik, 271 (2022), 170134. https://doi.org/10.1016/j.ijleo.2022.170134 doi: 10.1016/j.ijleo.2022.170134
[61]	S. Zhong, H. Zhou, X. Cui, X. Cao, F. Zhang, J. Duan, Infrared small target detection based on local-image construction and maximum correntropy, Measurement, 211 (2023), 112662. https://doi.org/10.1016/j.measurement.2023.112662 doi: 10.1016/j.measurement.2023.112662

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)