Commutativity of spatiochromatic covariance matrices in natural image statistics

Yiye Jiang; Jérémie Bigot; Edoardo Provenzi; Yiye Jiang; Jérémie Bigot; Edoardo Provenzi

doi:10.3934/mine.2020016

Mathematics in Engineering

2020, Volume 2, Issue 2: 313-339. doi: 10.3934/mine.2020016

Previous Article Next Article

Research article

Commutativity of spatiochromatic covariance matrices in natural image statistics

Université de Bordeaux, CNRS, Bordeaux INP, IMB, UMR 5251, F-33400, Talence, France

Received: 18 April 2019 Accepted: 23 December 2019 Published: 17 February 2020

Statistic of natural images is a growing field of research both in vision and image processing. On the vision research side, fine statistical details about object distribution in real-world scenes help understanding the human visual system behavior. On the image processing side, by using the information gathered from statistics of natural scenes, we can obtain reliable priors and insights that can be used in many models. In has been rigorously proven in ^[16] that, if second order stationarity and commutativity of spatiochromatic covariance matrices hold true for natural scenes, then the codification of spatial and chromatic information by the human visual system can be separated through a tensor product. Spatial features are coded via local and oriented Fourier basis elements, while color features are coded via a triad given by an achromatic channel followed by two color opponent channels. In this paper, we will show that, while stationarity is guaranteed, commutativity is not. However, we shall see that commutativity of spatiochromatic covariance matrices can be approached if the database of images used to model visual scenes is modified accordingly to a suitable transformation that describes the response of retinal photoreceptors to light absorption: the Michaelis-Menten formula. A thorough investigation of the effects of a parameter of this formula will be performed and its influence on commutativity of covariance matrices will be detailed.

Keywords:

Citation: Yiye Jiang, Jérémie Bigot, Edoardo Provenzi. Commutativity of spatiochromatic covariance matrices in natural image statistics[J]. Mathematics in Engineering, 2020, 2(2): 313-339. doi: 10.3934/mine.2020016

Related Papers:

[1]	Monica Conti, Filippo Dell'Oro, Vittorino Pata . Exponential decay of a first order linear Volterra equation. Mathematics in Engineering, 2020, 2(3): 459-471. doi: 10.3934/mine.2020021
[2]	William R. B. Lionheart . Histogram tomography. Mathematics in Engineering, 2020, 2(1): 55-74. doi: 10.3934/mine.2020004
[3]	Gianluca Crippa, Christian Schulze . Sub-exponential mixing of generalized cellular flows with bounded palenstrophy. Mathematics in Engineering, 2023, 5(1): 1-12. doi: 10.3934/mine.2023006
[4]	Gabriel B. Apolinário, Laurent Chevillard . Space-time statistics of a linear dynamical energy cascade model. Mathematics in Engineering, 2023, 5(2): 1-23. doi: 10.3934/mine.2023025
[5]	Lauri Oksanen, Mikko Salo . Inverse problems in imaging and engineering science. Mathematics in Engineering, 2020, 2(2): 287-289. doi: 10.3934/mine.2020014
[6]	Patrizia Di Gironimo, Salvatore Leonardi, Francesco Leonetti, Marta Macrì, Pier Vincenzo Petricca . Existence of solutions to some quasilinear degenerate elliptic systems with right hand side in a Marcinkiewicz space. Mathematics in Engineering, 2023, 5(3): 1-23. doi: 10.3934/mine.2023055
[7]	Zeljko Kereta, Valeriya Naumova . On an unsupervised method for parameter selection for the elastic net. Mathematics in Engineering, 2022, 4(6): 1-36. doi: 10.3934/mine.2022053
[8]	Mattia Fogagnolo, Andrea Pinamonti . Strict starshapedness of solutions to the horizontal p-Laplacian in the Heisenberg group. Mathematics in Engineering, 2021, 3(6): 1-15. doi: 10.3934/mine.2021046
[9]	Michael Herrmann, Karsten Matthies . Solitary waves in atomic chains and peridynamical media. Mathematics in Engineering, 2019, 1(2): 281-308. doi: 10.3934/mine.2019.2.281
[10]	Ansgar Jüngel, Ulisse Stefanelli, Lara Trussardi . A minimizing-movements approach to GENERIC systems. Mathematics in Engineering, 2022, 4(1): 1-18. doi: 10.3934/mine.2022005

Abstract

1. Introduction

In ^[16] a mathematical result about the separability of spatial and chromatic features of natural images has been established under two hypothesis: Second order stationarity and commutativity of the so-called spatiochromatic covariance matrices. Among other consequences, this result guarantees the well-posedness of several image codification models.

The closest representation of physical irradiance of a visual scene can be obtained through multispectral and high dynamic range (HDR) techniques. Unfortunately, nowadays technology permits the acquisition of artifact-free multispectral and HDR images only for still-life scenes, thus constraining too much the available data-set that can be used for statistical purposes. For this reason, knowing that we are considering a rough approximation of the true irradiance, we were forced to build our datasets by using Raw images obtained via RGB cameras.

In ^[16] it has been shown that spatiochromatic covariance matrices of raw images do not commute perfectly. However, the commutativity properties improve considerably if the information carried by raw images is modified according to an important transformation called Michaelis-Menten formula that can be written like this: $\textbf{u}_\mu(x) \mapsto \textbf{u}_\mu^\gamma(x)/(\textbf{u}_\mu^\gamma(x) + m_\mu^\gamma)$ , where $\textbf{u}$ is the RGB raw image function, $x$ is the pixel location, $\mu$ is one of the RGB chromatic channel, so that $\textbf{u}_\mu(x)$ is the image intensity in the pixel $x$ and in the chromatic channel $\mu$ and $m$ represents the average value. More specifically, we want to investigate the influence of the parameter $\gamma$ on the commutativity.

The reason why we consider this formula instead of others is that it has been empirically discovered in ^[20] that it fits the behavior of retinal cones in the transduction processed from light radiance to neuronal electric potential. Thus, the transformed images after the application of the Michaelis-Menten formula can be considered as a good approximation of the first signal input for the visual chain of events that characterize the human visual system.

We stress that the mathematical theory developed in this paper is aimed at formalizing and extending the results of ^[16], with the hope that they may be useful for vision scientists to refine their models.

The paper is structured as follows. In section 2, we recall the most important information that we need from the state of the art about statistics of natural images in order to introduce our work. In section 3, we discuss an estimation strategy of spatiochromatic covariance matrices of images, followed by a measure of their commutativity. In section 4.1, we present the image database that we used in our research. Most importantly, we introduce a crucial pre-processing filtering tool in our procedure: the sky classifier. In section 5, we discuss some result about the commutativity change after the application of the Michaelis-Menten transformation. Finally, in section 6 we resume our contributions and put our work in perspective.

2. Spatiochromatic features of natural images

There is a general agreement about the fact that the Human Visual System (HVS from now on) has evolved in order to optimize the elaboration of visual signals coming from scenes of the natural world (which will be shorten with natural scenes from now on, following the traditional nomenclature). Attneave ^[1], MacKay ^[11] and Barlow ^[2] pioneered the idea that the HVS may optimize the processing of natural signals by performing a redundancy reduction, however they did not quantify these ideas with a computational theory that can provide a coding for natural images.

Two kind of redundancies can be distinguished in the interaction between humans and the natural environment: Firstly, natural scenes have plenty of homogeneous areas and the light reflected from spatial locations belonging to those areas will be interpreted as the same visual information, this implies a strong spatial correlation. Secondly, light signals are absorbed by the three $L, M, S$ -type cones in the retina, whose sensitivity is not independent because they are highly overlapping. This implies a strong chromatic correlation. When both effects are taken into account, one speaks about spatiochromatic correlation.

The literature about natural image statistics is vast and its exhaustive presentation is far beyond the scope of this paper. Here we will deal only with the statistical information gathered by considering the spatial representation of the image and emphasizing the results from ^[4] and ^[19], which are essential to understand the development of our paper.

Before describing the results of ^[4] and ^[19], let us recall that when principal component analysis (PCA) is performed on small natural image patches, the basic features that result are Fourier descriptors, see for instance ^[13]). This fact is a consequence of spatial stationarity.

2.1. Chromatic redundancy in natural images

The first statistical information about chromatic redundancy has been experimentally obtained in ^[12] in the framework of color segmentation of RGB images. For each picture of a database of 8 RGB images, the authors computed the covariance matrix $C$ of the distribution of the values of $R$ , $G$ and $B$ at each pixel. They found that the eigenvectors of the covariance matrix are approximately the following ones for each image of the database: $\textbf{v}_1 = \left(\frac{1}{3}, \frac{1}{3}, \frac{1}{3}\right)^t$ , $\textbf{v}_2 = \left(\frac{1}{2}, 0, -\frac{1}{2}\right)^t$ , $\textbf{v}_3 = \left(-\frac{1}{4}, \frac{1}{2}, -\frac{1}{4}\right)^t$ . These vectors correspond to the three following uncorrelated color features: $X_1 = \frac{R+G+B}{3}$ , $X_2 = \frac{R-B}{2}$ , $X_3 = \frac{2G-(R+B)}{4}$ .

This shows that the feature related to the largest variance is the luminance $X_1$ (or achromatic channel) and the other two features are described by the opponent channels $X_2$ (red-blue) and $X_3$ (green-violet).

Buchsbaum and Gottshalk approached in ^[4] the problem of finding uncorrelated color features from a purely theoretical point of view. They considered the abstract ensemble of all possible visual stimuli (radiances) ${\cal S}\equiv \{S(\lambda), \; \lambda \in {\cal L}\}$ , where $\cal L$ is the spectrum of visible wavelengths. From a given representative $S(\lambda) \in \cal S$ , a weighted integration of $S(\lambda)$ over the visual spectrum, with weights given by the Vos-Walraven spectral sensitivity functions $L(\lambda), M(\lambda), S(\lambda)$ , yields the three cone activation values $L = \int_{\cal L} S(\lambda) L(\lambda) \, d\lambda$ , $M = \int_{\cal L} S(\lambda) M(\lambda) \, d\lambda$ , $S = \int_{\cal L} S(\lambda) S(\lambda) \, d\lambda$ .

Assuming that the stimulus $S(\lambda)$ (coming from a fixed point $\bar x$ of a scene) is a random variable, a covariance matrix can be build from the three random variables $L, M, S$ . This matrix, called the chromatic covariance matrix is defined as:

$\begin{equation} C = \begin{bmatrix} C_{LL} & C_{LM} & C_{LS}\\ C_{ML} & C_{MM} & C_{MS}\\ C_{SL} & C_{SM} & C_{SS} \end{bmatrix}, \end{equation}$

(2.1)

where $C_{LL}\equiv \mathbb{E}[L\cdot L]-(\mathbb{E}[L])^2$ , $C_{LM}\equiv \mathbb{E}[L\cdot M]-\mathbb{E}[L]\mathbb{E}[M] = C_{ML}$ , and so on, $\mathbb E$ being the expectation operator.

Let $K(\lambda, \mu) = \mathbb{E}[S(\lambda)S(\mu)]-\mathbb{E}[S(\lambda)]\cdot \mathbb{E}[S(\mu)]$ be the covariance function, then the entries of the covariance matrix can be written as $C_{LL} = \iint_{{\cal L}^2} K(\lambda, \mu) L(\lambda) L(\mu) \, d\lambda d\mu$ , and so on.

To be able to perform explicit calculations, the analytical form of the covariance function $K(\lambda, \mu)$ must be specified. In the absence of a database of multispectral images, Buchsbaum and Gottschalk used abstract non-realistic data, i.e., they chose the easiest covariance function corresponding to visual stimuli maximally uncorrelated with respect to their energy at different wavelengths, i.e., $K(\lambda, \mu) = \delta(\lambda-\mu)$ , $\delta$ being the Dirac distribution. As the authors themselves observe, this condition is satisfied only if the ensemble $\cal S$ is made of monochromatic signals.

With this choice, the entries of the covariance matrix $C$ are all positives and they can be written as $C_{LL} = \int_{\cal L} L^2(\lambda) \, d\lambda$ , $C_{LM} = \int_{\cal L} L(\lambda)M(\lambda) \, d\lambda$ , and so on. $C$ is also real and symmetric, so it has three positive eigenvalues $\lambda_1 \geq \lambda_2 \geq \lambda_3$ with corresponding eigenvectors $\textbf{v}_i$ , $i = 1, 2, 3$ . If $W$ is the matrix whose columns are the eigenvectors of $C$ , i.e., $W = [\textbf{v}_1|\textbf{v}_2|\textbf{v}_3]$ , then the diagonalization of $C$ is given by $\Lambda = W^t C W = \text{diag}(\lambda_1, \lambda_2, \lambda_3)$ .

The eigenvector transformation of the cone excitation values $L, M, S$ , in the special case of monochromatic stimuli, is then

$\begin{equation} \begin{pmatrix} A(\lambda) \\ P(\lambda) \\ Q(\lambda) \end{pmatrix} = W^t \begin{pmatrix} L(\lambda) \\ M(\lambda) \\ S(\lambda) \end{pmatrix}. \end{equation}$

(2.2)

The transformed values $A, P, Q$ are uncorrelated and their covariance matrix is $\Lambda$ . $A$ is the achromatic channel, while $P$ and $Q$ are associated to the opponent chromatic channels.

The key point in Buchsbaum and Gottschalk's theory is the application of the Perron-Frobenius theorem (see e.g., ^[3] for more details), which assures that positive matrices, i.e., matrices whose entries are all strictly greater than zero, have one and only one eigenvector whose entries have all the positive sign, and this eigenvector corresponds to the largest eigenvalue, i.e., $\lambda_1$ . So, only the transformed $A$ channel will be a linear combination of the cone activation values $L, M, S$ with positive coefficients, while the channels $P$ and $Q$ will show opponency. This is the theoretical reason underlying the evidence of post-retinal chromatic opponent behavior, following Buchsbaum and Gottschalk.

Using the data obtained above, Buchsbaum and Gottschalk could write explicitly the transformation from $(L, M, S)$ to $(A, P, Q)$ as follows:

$\begin{equation} \begin{cases} A \simeq 0.887 L + 0.461 M \\ P \simeq -0.46 L + 0.88 M \\ Q = 0.004 L - 0.01 M + 0.99 S. \end{cases} \end{equation}$

(2.3)

More information about the relationship between Buchsbaum and Gottschalk's theory and other well-known color spaces can be found in ^[10].

2.2. Spatiochromatic redundancy in natural images

The most influential paper about spatiochromatic feature is ^[19], where Ruderman, Cronin and Chiao proposed a patch-based spatiochromatic coding and tested Buchsbaum-Gottschalk's theory on a database of 12 multispectral natural images of foliage. The authors have shown that the scatterplots in the $LM$ and $LS$ planes of the $L, M, S$ cone activations values (corresponding to 1000 pixels randomly selected in the database) show a high degree of correlation but also asymmetry. So, they decided to study these data by first reducing their asymmetry: They modified the $LMS$ values by taking their decimal logarithm and then they subtracted the average image value in the logarithmic domain. They obtained the so-called Ruderman-Cronin-Chiao coordinates, i.e., $\tilde L = \text{Log} \, L - \langle \text{Log} \, L \rangle$ , ${\tilde M} = \text{Log} \, M - \langle \text{Log} \, M \rangle$ and ${\tilde S} = \text{Log} \, S - \langle \text{Log} \, S \rangle$ . Following ^[19], if $\tilde L$ , $\tilde M$ , $\tilde S$ are the basis vectors in the logarithmically-transformed space, then the application of the PCA gives the following three principal axes:

$\begin{equation} \begin{cases} \ell = \frac{1}{\sqrt{3}} ({\tilde L}+{\tilde M}+{\tilde S}) \\ \alpha = \frac{1}{\sqrt{6}} ({\tilde L} + {\tilde M} -2{\tilde S})\\ \beta = \frac{1}{\sqrt{2}} ({\tilde L} - {\tilde M}). \end{cases} \end{equation}$

(2.4)

The color space spanned by these three principal axes is called $\ell\alpha \beta$ space.

The key point of the paper is the idea to study spatiochromatic features by considering $3 \times 3$ patches, with each pixel containing a 3-vector color information, so that every patch is converted in a vector with 27 components that were analyzed with the PCA. The principal axes of these small patches in the logarithmic space show that the first fluctuations are in the achromatic channel, followed by blue-yellow fluctuations in the $\alpha$ direction and red-green ones in the $\beta$ direction.

The spatial axes are largely symmetrical and can be represented by Fourier features, in line with the translation-invariance of natural images, as argued in ^[5]. It is important to stress that no pixel within the patches appear other than the primary gray, blue-yellow or red-green colors, i.e., no mixing of $\ell, \alpha, \beta$ has been found in any $3 \times 3$ patch. This means that not only the single-pixel principal axes $\ell, \alpha, \beta$ , but also the spatially-dependent principal axes $\ell(x), \alpha(x), \beta(x)$ , viewed as functions of the spatial coordinate $x$ inside the patches, are decorrelated. These results have been confirmed by ^[14].

2.3. Second order stationarity

Let us now analyze the consequence of second order stationarity in natural images on their decorrelated spatiochromatic features. For the sake of clarity, we will first start with the simplest case of gray-level images, where stationarity implies that the principal components are Fourier basis functions. We will then extend this result to the color case and show that a supplementary hypothesis on color covariance matrices yields principal components given by the tensor product between Fourier basis functions on one side, and achromatic plus opponent color coordinates on the other.

2.4. The gray-level case

Let $I$ be a gray-level natural image of dimension $W \times H$ . We denote the $H$ rows of $I$ as $r^0, \ldots, r^{H-1}$ and the position of each pixel of $I$ row-wise as follows^* :

^* To avoid cumbersome repetitions of the indexes variability, from now on, we will suppose that $j, j' \in \{0, \ldots H-1\}$ and $k, k' \in \{0, \ldots W-1\}$ , unless otherwise specified.

$\begin{equation} I = \{r^j_k; \; j = 0, \ldots, H-1, \; k = 0, \ldots, W-1\}. \end{equation}$

(2.5)

Each row $r^j = (r^j_0, \ldots, r^j_{W-1})$ will be interpreted as a $W$ -dimensional random vector and each component $r^j_k$ as a random variable.

Let us define the spatial covariance of the two random variables $r^j_k$ , $r^{j'}_{k'}$ :

$\begin{equation} \mbox{cov}(r^j_k, r^{j'}_{k'}) \equiv c^{j, j'}_{k, k'} = {\mathbb E}[r^j_k r^{j'}_{k'}] - {\mathbb E}[r^j_k] {\mathbb E}[r^{j'}_{k'}]. \end{equation}$

(2.6)

Due to the symmetry of covariance we have $c^{j, j'}_{k, k'} = c^{j', j}_{k', k}$ . We write the spatial covariance matrix of the two random vectors $r^j$ , $r^{j'}$ as $\mbox{cov}(r^j, r^{j'})\equiv C^{j, j'}$ , where $C^{j, j'}$ is the $W \times W$ matrix:

$\begin{equation} C^{j, j'} = \begin{bmatrix} c^{j, j'}_{0, 0} & c^{j, j'}_{0, 1} & \cdots & c^{j, j'}_{0, W-1} \\ c^{j, j'}_{1, 0} & c^{j, j'}_{1, 1} & \cdots & c^{j, j'}_{1, W-1} \\ \vdots & \vdots & \ddots & \vdots \\ c^{j, j'}_{W-1, 0} & \cdots & \cdots & c^{j, j'}_{W-1, W-1} \end{bmatrix}. \end{equation}$

(2.7)

Finally, the spatial covariance matrix $C$ of the image $I$ can be written as:

$\begin{equation} C = \begin{bmatrix} C^{0, 0} & C^{0, 1} & \cdots & C^{0, H-1} \\ C^{1, 0} & C^{1, 1} & \cdots & C^{1, H-1} \\ \vdots & \vdots & \ddots & \vdots \\ C^{H-1, 0} & \cdots & \cdots & C^{H-1, H-1} \end{bmatrix}. \end{equation}$

(2.8)

Notice that $C$ is a $HW \times HW$ matrix because each sub-matrix $C^{j, j'}$ is a $W \times W$ matrix.

Hypothesis 1. From now on, the covariance of $I$ is assumed to be invariant under translations of the row and column index, i.e., $c^{j, j'}_{k, k'} = c^{\vert j-j'\vert}_{\vert k-k' \vert}$ .

Hypothesis 1 is weaker than the typical definition of second order stationarity because here we do not assume the translation invariance of the mean. Alongside this hypothesis, we add a technical requirement on the geometry of digital images which is implicitly assumed every time the Fourier transform is considered, i.e., we will consider a symmetrized spatial domain with a toroidal distance, which means that we will perform the identification $r^j_k = r^{j'}_{k'}$ when $j\equiv j'$ (mod $H$ ) and $k\equiv k'$ (mod $W$ ), i.e., every time there exist $a, b \in \mathbb Z$ such that $j'-j = aH$ and $k'-k = bW$ .

As a covariance matrix, $C$ is real, symmetric and positive-definite. Now, as a consequence of the previous hypotheses, the matrix $C$ is also block-circulant with circulant blocks. Indeed, the $C^{j, j'}$ are circulant matrices, i.e., matrices where each row vector is rotated one element to the right relative to the preceding row vector^†, or, equivalently, each column vector is rotated one element down with respect to the preceding column vector. If we use the convenient shorthand notation 'circ $(\; )$ ' to denote a circulant matrix, by specifying only the first row (or, equivalently, the first column, due to symmetry) between the round brackets, then $C^{j, j'}$ can be written as follows:

^† This can be easily verified by noticing that $c^{j, j'}_{k, k'} = c^{j, j'}_{k+1, k'+1}$ .

$\begin{equation} C^{j, j'} = {\rm circ}\left(c^{j, j'}_{0, 0}, c^{j, j'}_{0, 1}, \ldots, c^{j, j'}_{0, W-1}\right). \end{equation}$

(2.9)

Now, if we write $C^j\equiv C^{0, j}$ , $j = {0, \ldots, H-1}$ it is straightforward to see that the covariance matrix $C$ is block-circulant and can be explicitly written as:

$\begin{equation} C = {\rm circ}\left(C^{0}, C^{1}, \ldots, C^{H-1}\right). \end{equation}$

(2.10)

It is well known that an $n\times n$ circulant matrix has $n$ eigenvalues corresponding to the components of the DFT of the finite sequence given by the first row of the matrix itself, and its eigenvectors are the Fourier basis functions, see e.g., ^[6] or ^[8].

Let us apply this general result to the $W\times W$ circulant matrices $C^j$ . The set of eigenvalue equations $C^j\textbf{e}_m = \lambda^ j_m \textbf{e}_m$ , $\lambda^j \in \mbox{ $ \mathbb{C} $ }$ and $\textbf{e}\in \mbox{ $ \mathbb{C} $ }^W$ , $m = 0, \ldots, W-1$ , can be written as the following matrix equation $C^jE_W = \Lambda^j E_W$ , where^‡:

^‡ We have used the simplified notation $c^{j}_{m}\equiv c^{0, j}_{0, m}$ to denote the matrix element of position $m$ in the first row of the matrix $C^j\equiv C^{0, j}$ , $m = 0, \ldots, W-1$ .

$\begin{equation} \Lambda^{j} = \sqrt{W}\text{diag}(\hat{c}^j_m;\; m = 0, \ldots, W-1), \quad \hat{c}^j_m = \frac{1}{\sqrt{W}}\sum\limits_{k = 0}^{W-1} c^j_k e^{-\frac{2\pi imk}{W}}, \end{equation}$

(2.11)

and $E_W$ is the Vandermonde matrix which implements the DFT, i.e., the so-called Sylvester matrix :

$\begin{equation} \begin{split} E_W & = \left[\textbf{e}_0 | \textbf{e}_1 |\cdots |\textbf{e}_{W-1}\right] \\ & = \left[\textbf{e}_m = \frac{1}{\sqrt{W}}\left(1, e^{-\frac{2\pi i m}{W}}, \ldots, e^{-\frac{2\pi i m(W-1)}{W}} \right)^t\right]_{m = 0, \ldots, W-1} \\ & = \frac{1}{\sqrt{W}} \begin{bmatrix} 1 & 1 & \cdots & 1 \\ 1 & e^{-\frac{2\pi i}{W}} & \cdots & e^{-\frac{2\pi i(W-1)}{W}} \\ \vdots & \vdots & \ddots & \vdots \\ 1 & e^{-\frac{2\pi i(W-1)}{W}} & \cdots & e^{-\frac{2\pi i(W-1)^2}{W}} \end{bmatrix}. \end{split} \end{equation}$

(2.12)

The following remark will help us understanding how to extend the previous diagonalization procedure to the whole matrix $C$ .

Remark 1. Let $M = \text{circ}(M^0, \ldots, M^{H-1})$ be a block-circulant matrix and let us assume that the blocks $M^j$ can be diagonalized on the same basis $B$ . If we write $E_H = \left[\textbf{e}_0 | \textbf{e}_1 |\cdots |\textbf{e}_{H-1}\right]$ , with the vectors $\textbf{e}_j$ defined as in Eq. (2.12) for all $j = 0, \ldots, H-1$ , then it can be verified by direct computation that $E_H \otimes B$ is a basis of eigenvectors of $M$ , where $\otimes$ denotes the Kronecker product.

In the case of our spatial covariance matrix $C$ , all the submatrices $C^j$ have the same basis of eigenvectors $E_W$ , thus the result stated in the previous remark can be directly applied on the matric $C$ to guarantee that

$\begin{equation} E_H \otimes E_W = \left[\textbf{e}_{m, l} = \frac{1}{\sqrt{HW}}\left(1, e^{-2\pi i\left(\frac{m}{W}+\frac{l}{H}\right)}, \ldots, e^{-2\pi i\left(\frac{m(W-1)}{W}+\frac{l(H-1)}{H}\right)} \right)^t\right]_{m, l}, \end{equation}$

(2.13)

for $m = 0, \ldots, W-1$ , and $l = 0, \ldots, H-1$ provides a basis of eigenvectors for the matrix $C$ .

Actually, due to the symmetry of covariance matrices, the complex parts of the exponentials cancel out (see ^[8]) and so the 2D cosine Fourier basis also constitutes a basis of eigenvectors of $C$ :

$\begin{equation} \textbf{e}_{m, l} = \frac{1}{\sqrt{HW}}\left(1, \cos\left(2\pi \left(\frac{m}{W}+\frac{l}{H}\right)\right), \ldots, \cos\left(2\pi \left(\frac{m(W-1)}{W}+\frac{l(H-1)}{H}\right)\right) \right)^t. \end{equation}$

(2.14)

2.5. The color case

Let us consider now an RGB image function $\textbf{u}:\Omega \to [0, 255]^3$ , where $\Omega$ is the spatial domain, and, for all $(j, k)\in \Omega$ , $\textbf{u}(j, k) = (R(j, k), G(j, k), B(j, k))$ is the vector whose components are the red, green and blue intensity values of the pixel defined by the coordinates $(j, k)$ .

We define the spatiochromatic covariance matrix among two pixels of position $(j, k)$ and $(j', k')$ by extending Eq. (2.6) as follows:

$\begin{equation} \textbf{c}^{j, j'}_{k, k'} = \begin{bmatrix} c^{j, j'}_{k, k'}(R, R) & c^{j, j'}_{k, k'}(R, G) & c^{j, j'}_{k, k'}(R, B) \\ c^{j, j'}_{k, k'}(G, R) & c^{j, j'}_{k, k'}(G, G) & c^{j, j'}_{k, k'}(G, B) \\ c^{j, j'}_{k, k'}(B, R) & c^{j, j'}_{k, k'}(B, G) & c^{j, j'}_{k, k'}(B, B) \end{bmatrix} \end{equation}$

(2.15)

where we defined $c^{j, j'}_{k, k'}(R, R) = {\mathbb E}[R(j, k) R(j', k')] - {\mathbb E}[R(j, k)] {\mathbb E}[R(j', k')]$ , $c^{j, j'}_{k, k'}(R, G) = {\mathbb E}[R(j, k) G(j', k')] - {\mathbb E}[R(j, k)] {\mathbb E}[G(j', k')]$ , and similarly for the remaining matrix elements. Of course the matrix $\textbf{c}^{j, j'}_{k, k'}$ is symmetric because $c^{j, j'}_{k, k'}(G, R) = {\mathbb E}[G(j, k) R(j', k')] - {\mathbb E}[G(j, k)] {\mathbb E}[R(j', k')] = c^{j, j'}_{k, k'}(R, G)$ , and similarly for all the other off-diagonal elements.

Naturally, we can extend Hypothesis 1 to RGB image case, as follows.

Hypothesis 1 (RGB case). The spatiochromatic covariance of u of the chromatic channels $\mu, \nu$ is assumed to be invariant under translations of row and column index, i.e., $c^{j, j'}_{k, k'}(\mu, \nu) = c^{|j-j'|}_{|k-k'|}(\mu, \nu)$ , for all $\mu, \nu \in { R, G, B}$ .

With the same technical requirements, we know that $C(\mu, \nu)$ is also block-circulant with circulant blocks.

In the particular case defined by $j' = j$ and $k' = k$ , we will call $\textbf{c}^{j, j'}_{k, k'}$ 'chromatic autocovariance' and denote it simply as $\textbf{c}^0$ . Notice that the matrix analyzed in ^[4] is the chromatic autocovariance of the LMS values.

We then define the spatiochromatic covariance matrix $\textbf{C}^{j, j'}$ among the two random vectors $r^j$ , $r^{j'}$ given by the $j$ -th and $j'$ -the rows of the spatial support of ${\bf u}$ by extending Eq. (2.7) as follows:

$\begin{equation} \textbf{C}^{j, j'} = \begin{bmatrix} \textbf{c}^{j, j'}_{0, 0} & \textbf{c}^{j, j'}_{0, 1} & \cdots & \textbf{c}^{j, j'}_{0, W-1} \\ \textbf{c}^{j, j'}_{1, 0} & \textbf{c}^{j, j'}_{1, 1} & \cdots & \textbf{c}^{j, j'}_{1, W-1} \\ \vdots & \vdots & \ddots & \vdots \\ \textbf{c}^{j, j'}_{W-1, 0} & \cdots & \cdots & \textbf{c}^{j, j'}_{W-1, W-1} \end{bmatrix}. \end{equation}$

(2.16)

Finally, we define the spatiochromatic covariance matrix $\textbf{C}$ of the RGB image $\textbf{u}$ by extending Eq. (2.8) to the $3HW \times 3HW$ matrix defined in this way:

$\begin{equation} C = \begin{bmatrix} \textbf{C}^{0, 0} & \textbf{C}^{0, 1} & \cdots & \textbf{C}^{0, H-1} \\ \textbf{C}^{1, 0} & \textbf{C}^{1, 1} & \cdots & \textbf{C}^{1, H-1} \\ \vdots & \vdots & \ddots & \vdots \\ \textbf{C}^{H-1, 0} & \cdots & \cdots & \textbf{C}^{H-1, H-1} \end{bmatrix}. \end{equation}$

(2.17)

Now, supposing that all the elements of the matrices (2.15) are positive, thanks to the Perron-Frobenius theorem we can assure that each of these $\textbf{c}^{j, j'}_{k, k'}$ matrices has a basis of eigenvectors that can be written as a triad of achromatic plus opponent chromatic channels. {If we further assume that the matrices (2.15) can be diagonalized on the same basis of eigenvectors $(A, P, Q)$ , then, thanks to Remark 1, we have that the eigenvectors of the spatiochromatic covariance matrix $C(R, G, B)$ can be written as the following Kronecker product:}

$\begin{equation} (A, P, Q)\otimes \textbf{e}_{m, l} \in \mbox{$ \mathbb{R} $}^{3HW}, \end{equation}$

(2.18)

which is precisely the type of eigenvectors that have been exhibited experimentally in ^[18]. A standard result of linear algebra guarantees that a set of matrices can be diagonalized on the same basis of eigenvectors if and only if they commute^§. Thanks to the hypothesis of translation invariance of covariance, this is verified if and only if the generic covariance matrix $\textbf{c}^{j, j'}_{k, k'}$ commutes with the chromatic autocovariance matrix $\textbf{c}^0$ .

^§ We recall that, given two generic matrices $A$ and $B$ for which the products $AB$ and $BA$ is well defined, $[A, B]\equiv AB-BA$ is called the 'commutator' between them. Of course $A$ and $B$ commute if and only if $[A, B] = 0$ .

The following proposition holds true ^[16].

Proposition 1. Let $\textbf{u}:\Omega \to [0, 255]^3$ be an RGB image function, with a periodized spatial domain $\Omega$ , and suppose that

1. All matrices $\textbf{c}^{j, j'}_{k, k'}$ are positive, i.e., their elements are strictly greater than 0;

2. The spatiochromatic covariance matrices $\textbf{c}^{j, j'}_{k, k'}$ defined in (2.15) depend only on the distances $|j-j'|$ , $|k-k'|$ , i.e., the covariance of $\textbf{u}$ is stationary;

3. The following commutation property holds:

$\begin{equation} [\textbf{c}^0, \textbf{c}^{j, j'}_{k, k'}] = 0 \qquad \forall (j, k), (j', k')\in \Omega. \end{equation}$

(2.19)

Then, the eigenvectors of the spatiochromatic covariance matrix $\textbf{C}$ defined in (2.17) can be written as the Kronecker product $(A, P, Q)\otimes \textbf{e}_{m, l}$ , where $(A, P, Q)$ is the achromatic plus opponent color channels triad and $\textbf{e}_{m, l}$ is the 2D cosine Fourier basis defined in Eq. (2.14).

Proposition 1 defines a mathematical framework where the empirical result of Ruderman et al. can be formalized and understood in terms of statistical properties of natural images. In ^[16], the hypotheses of the proposition above have been checked thanks to simulations performed on databases of natural images: The first two hypotheses have been confirmed, while the third will be discussed in detail in the following part of the paper.

2.6. Commutativity and exponential decay of covariance matrices

The experiments conducted in ^[16] empirically discovered a linear relationship in the semi-logarithmic scale between spatiochromatic covariance $\textbf{c}^{j, j'}_{k, k'}(\mu, \nu)$ and pixel distance $d = \sqrt{(j-j')^2 + (k-k')^2}$ :

$\begin{equation} \log(\textbf{c}_{\mu\nu}^d) = \alpha_{\mu\nu} + \beta_{\mu\nu}d. \end{equation}$

(2.20)

This, of course, implies an exponential decay for spatiochromatic covariance that corrected the power-law decay that was commonly supposed to hold true. These analytic expressions will allow us performing a theoretical analysis of the covariance estimators $\hat{\textbf{c}}_{\mu\nu}^d$ , which will play an important role in the discussion of commutativity.

It must be underlined that the exponential decay of $\textbf{c}_{\mu\nu}^d$ holds true with a great amount of precision only for an intermediate pixel distance range and it slightly deviates from it when $d$ is very small or very large. These deviations are to be expected, because of two different reasons. When $d$ is small, noise and the convolution kernel used by image sensors ^[15] introduce non linearity between irradiance and pixel intensities; when $d$ is large, the sensor response is altered by optical phenomena as vignetting ^[7] or an incorrect camera aperture.

This is the reason why, in papers dealing with databases of natural images (see, e.g., ^[19]), it is common to consider a reduced range of distances to compute statistical features. We will follow the same convention while dealing with experiments. However, for the theoretical part of the paper, we will allow the validity of the exponential decay for any distance $d\geq 0$ .

3. Estimation of spatiochromatic covariances

In this section, we will propose a reliable method to estimate the spatiochromatic covariances and then we will analyze their properties.

3.1. Construction of covariance estimators

Let us start by introducing some notation and nomenclature. $\textbf{u}_n:\Omega \to [0, 1]^3$ , $n = 1, 2, \ldots, N$ , is the $n$ -th RGB image function with spatial support $\Omega$ (common to all images). For each fixed pixel location $x\in \Omega$ , $\textbf{u}_n(x) = (R_n(x), G_n(x), B_n(x))$ is the RGB value of $x$ in the three chromatic channels. The values $\{\textbf{u}_n(x), \; x\in \Omega\}_{n = 1, \ldots, N}$ are i.i.d. image samples with finite population mean and covariance. First of all, we compute the average image ${{\bf{\bar u}}} = \frac{1}{N}\sum\limits_{n = 1}^N \textbf{u}_n = (\bar R, \bar G, \bar B)$ and subtract it from each image to get centered images $\tilde{\textbf{u}}_n = \textbf{u}_n - {{\bf{\bar u}}} = (R_n - \bar{R}, G_n-\bar{G}, B_n-\bar{B})$ . The $\mu$ -channel of the $n$ -th centered image will be written as $\tilde{\textbf{u}}_{\mu, n}$ .

The main task that we must perform is to estimate the coefficients $\beta_{\mu\nu}$ in Eq. (2.20). For this, we decided to use the classical ordinary least squares (OLS) estimation ^[17], which requires the samples in the regression model to be independent. However, in practice, we face the problem that the number of raw images that we have at disposal in our database (and, in general, in the databases publicly available) is not sufficient to provide enough independent samples.

To have a quantitative idea, the final image samples that we have for empirical studies of covariance are 701, suppose that we build a regression model according to the exponential decay and we fit it with the OLS estimation. Suppose also that the distance $d$ ranges from 0 to 100 with step 1 and $\mu, \nu \in \{R, G, B\}$ , this implies the need of 909 covariance independent estimators given by $\log(\hat{\textbf{c}}_{\mu\nu}^d)$ .

Let us describe how we have computed the estimators accordingly to the OLS prescriptions and overcame this problem. In each centered image, we sample $P$ location pairs given by a center and its neighbor with a fixed step size $s$ .

Centers are fixed in each image and their locations have coordinates

$\left\{(j_p, k_p), \; j_p = 0, s, 2s, \ldots, \left[\frac{H-1}{s}\right] s, \; k_p = 0, s, 2s, \ldots, \left[\frac{W-1}{s}\right] s, \; p = 1, \ldots, P\right\},$

where, $P = \left(\left[\frac{H-1}{s}\right] + 1\right)\left(\left[\frac{W-1}{s}\right] + 1\right)$ , and $\left[\right]$ takes the floor. Neighbours are also fixed throughout all images, while we do not restrict their locations as long as their distance from the centers remains $d$ , for each fixed value of $d$ . We write their locations as $(j'_p, k'_p)$ , with the same range variability as $(j_p, k_p)$ .

We notice that the set of centers can be identified with a downsampled version of the original image, so that we can consider the stationary hypothesis and its consequences to hold true also for the set of centers.

Finally, we estimate $\textbf{c}_{\mu\nu}^d$ as follows:

$\begin{equation} \hat{\textbf{c}}_{\mu\nu}^d = \frac{\sum\limits_{n = 1}^N \sum\limits_{p = 1}^P \tilde{\textbf{u}}_{\mu, n}(j_p, k_p) \tilde{\textbf{u}}_{\nu, n}(j'_p, k'_p)}{(N-1)P}. \end{equation}$

(3.1)

To simplify the notation, let us write $x_{np}^\mu = \tilde{\textbf{u}}_{\mu, n}(j_p, k_p)$ for the centers and $y_{np}^\nu = \tilde{\textbf{u}}_{\nu, n}(j'_p, k'_p)$ for the neighbors.

Finally, the estimators of spatiochromatic covariance matrices are:

$\begin{equation} \hat{\textbf{c}}^d = \begin{bmatrix} \hat{\textbf{c}}_{RR}^d & \hat{\textbf{c}}_{RG}^d & \hat{\textbf{c}}_{RB}^d \\ \hat{\textbf{c}}_{GR}^d & \hat{\textbf{c}}_{GG}^d & \hat{\textbf{c}}_{GB}^d \\ \hat{\textbf{c}}_{BR}^d & \hat{\textbf{c}}_{BG}^d & \hat{\textbf{c}}_{BB}^d \end{bmatrix}. \end{equation}$

(3.2)

We resume this construction in the scheme visualized in Figures 1 and 2.

Figure 1. Sampling strategy.

DownLoad: Full-Size Img PowerPoint

Figure 2. Construction of spatiochromatic covariance estimators.

DownLoad: Full-Size Img PowerPoint

3.2. Properties of $\hat{\textbf{c}}_{\mu\nu}^d$

Let us now discuss the properties of the estimators that we have built in the previous section. First of all, $\hat{\textbf{c}}_{\mu\nu}^d$ are unbiased: in fact $\mathbb{E}(\tilde{\textbf{u}}_n) = 0$ , and $\mbox{cov}(\tilde{\textbf{u}}_n) = \frac{N-1}{N} \mbox{cov}(\textbf{u}_n)$ , thus $\mathbb{E}(\hat{\textbf{c}}_{\mu\nu}^d) = \frac{\sum_{n = 1}^N \sum_{p = 1}^P E(x_{np}^u y_{np}^v)}{(N-1)P} = \textbf{c}_{\mu\nu}^d$ .

Then, we observe that, during the construction process to compute the estimators, we will keep introducing covariance values. Nevertheless, thanks to the covariance exponential decay, this accumulation will not cause a large variance of the estimators. In fact, we can prove that it exists an upper bound for $\mbox{cov}(\textbf{c}_{\mu\nu}^d, \textbf{c}_{\mu'\nu'}^{d'})$ , where $\mu, \nu, d$ and $\mu', \nu', d'$ are two different set of parameters, and that this quantity decreases with $P$ , the number of samples.

In order to do that, we first notice that, by direct computation, it can be verified that if $C$ is a block-circulant matrix with circulant blocks, i.e., $C^{j, j'} = \text{circ}(c^{j, j'}_{0, 0}, c^{j, j'}_{0, 1}, ..., c^{j, j'}_{0, W-1})$ and $C = \text{circ}(C^0, C^1, ..., C^{H-1})$ , then $\sum\limits_{j, j' = 0}^{H-1}\sum\limits_{k, k' = 0}^{W-1} c^{j, j'}_{k, k'} = HW\sum\limits_{j = 0}^{H-1}\sum\limits_{k = 0}^{W-1} c^{0, j}_{0, k}$ .

Since the centers $\{x^\mu_{np}\}_p$ constitute a downsampled version of the image $\textbf{u}_{\mu, n}$ , its spatiochromatic covariance is also endowed with properties mentioned in section 2, as well as the one above, thus

$\begin{equation} \begin{aligned} \mbox{cov}(\hat{\textbf{c}}_{\mu\nu}^d, \hat{\textbf{c}}_{\mu'\nu'}^{d'}) & = \mbox{cov}\left(\frac{\sum\limits_{n = 1}^N \sum\limits_{p = 1}^P x_{np}^\mu y_{np}^\nu}{(N-1)P}, \frac{\sum\limits_{n' = 1}^N \sum\limits_{p' = 1}^P x_{n'p'}^{\mu'} y_{n'p'}^{\nu'}}{(N-1)P} \right)\\ & = \frac{1}{(N-1)^2 P^2}\sum\limits_{n = 1}^N\sum\limits_{p = 1}^P \sum\limits_{p' = 1}^P \mathbb{E}(x_{np}^\mu y_{np}^\nu x_{np'}^{\mu'} y_{np'}^{\nu'}) - \frac{1}{N} \textbf{c}^d_{\mu\nu}\textbf{c}^{d'}_{\mu'\nu'}, \end{aligned} \end{equation}$

(3.3)

thus

$\begin{equation} | \mbox{cov}(\hat{\textbf{c}}_{\mu\nu}^d, \hat{\textbf{c}}_{\mu'\nu'}^{d'})| \leq \frac{1}{(N-1)^2 P^2}\sum\limits_{n = 1}^N\sum\limits_{p = 1}^P \sum\limits_{p' = 1}^P |\mathbb{E}(x_{np}^\mu y_{np}^\nu x_{np'}^{\mu'} y_{np'}^{\nu'})| + \frac{1}{N} |\textbf{c}^d_{\mu\nu}| |\textbf{c}^{d'}_{\mu'\nu'}|. \end{equation}$

(3.4)

Moreover, since $y_{np}^{\mu} \leq 1$ and $y_{np'}^{\nu'} \leq 1$ , then $|\mathbb{E}(x_{np}^\mu y_{np}^\nu x_{np'}^{\mu'} y_{np'}^{\nu'})| \leq |\mathbb{E}(x_{np}^\mu x_{np'}^{\mu'})| = \frac{N-1}{N} \textbf{c}^{\text{dist}[(j_p, k_p), (j_{p'}, k_{p'})]}_{\mu\mu'}$ .

Therefore $\sum\limits_{p = 1}^P \sum\limits_{p' = 1}^P |\mathbb{E}(x_{np}^\mu y_{np}^\nu x_{np'}^{\mu'} y_{np'}^{\nu'})| \leq \frac{N-1}{N}\sum\limits_{p = 1}^P \sum\limits_{p' = 1}^P \textbf{c}^{\text{dist}[(j_p, k_p), (j_{p'}, k_{p'})]}_{\mu\mu'}$ . From the remark above about circulant block matrices and from Hypothesis 1 (RGB case), it follows that:

$\begin{equation} \begin{aligned} \sum\limits_{p = 1}^P \sum\limits_{p' = 1}^P \textbf{c}^{\text{dist}[(j_p, k_p), (j_{p'}, k_{p'})]}_{\mu\mu'} & = P \sum\limits_{p = 1}^P \textbf{c}^{\text{dist}[(j_1, k_1), (j_{p}, k_{p})]}_{\mu\mu'}\\ & = P e^{\alpha_{\mu\mu'}} \sum\limits_{l = 0}^{\left[\frac{H-1}{s}\right]}\sum\limits_{m = 0}^{\left[\frac{W-1}{s}\right]} e^{\beta_{\mu\mu'} s\sqrt{l^2 + m^2} }. \end{aligned} \end{equation}$

(3.5)

Since $\beta_{\mu\nu} < 0$ , then:

$\begin{equation} \begin{aligned} \sum\limits_{l = 0}^{\left[\frac{H-1}{s}\right]}\sum\limits_{m = 0}^{\left[\frac{W-1}{s}\right]} e^{\beta_{\mu\mu'} s\sqrt{l^2 + m^2}} &\leq \sum\limits_{l = 0}^{\left[\frac{H-1}{s}\right]}\sum\limits_{m = 0}^{\left[\frac{W-1}{s}\right]} e^{\beta_{\mu\mu'} s\sqrt{l^2 + 0}}\\ & = \sum\limits_{l = 0}^{\left[\frac{H-1}{s}\right]} e^{\beta_{\mu\mu'} sl}\left(\left[\frac{W-1}{s}\right] + 1\right) \\ & = \frac{1- e^{\beta_{\mu\mu'}s \left(\left[\frac{H-1}{s}\right] + 1\right)}}{1- e^{\beta_{\mu\mu'} s}}\left(\left[\frac{W-1}{s}\right] + 1\right). \end{aligned} \end{equation}$

(3.6)

Moreover, since $P = \left(\left[\frac{H-1}{s}\right] + 1\right)\left(\left[\frac{W-1}{s}\right] + 1\right)$ , we can get:

$\begin{equation} \begin{aligned} \frac{1}{P} \sum\limits_{l = 0}^{\left[\frac{H-1}{s}\right]}\sum\limits_{m = 0}^{\left[\frac{W-1}{s}\right]} e^{\beta_{\mu\mu'} s\sqrt{l^2 + m^2}} &\leq \frac{1- e^{\beta_{\mu\mu'}s \left(\left[\frac{H-1}{s}\right] + 1\right)}}{(1- e^{\beta_{\mu\mu'} s})\left(\left[\frac{H-1}{s}\right] + 1\right)}, \end{aligned} \end{equation}$

(3.7)

therefore:

$\begin{equation} \begin{aligned} | \mbox{cov}(\hat{\textbf{c}}_{\mu\nu}^d, \hat{\textbf{c}}_{\mu'\nu'}^{d'})| & \leq \frac{e^{\alpha_{\mu\mu'}}}{(N-1)} \frac{1- e^{\beta_{\mu\mu'}s \left(\left[\frac{H-1}{s}\right] + 1\right)}}{(1- e^{\beta_{\mu\mu'} s})\left(\left[\frac{H-1}{s}\right] + 1\right)} + \frac{1}{N} |\textbf{c}^d_{\mu\nu}\textbf{c}^{d'}_{\mu'\nu'}|, \end{aligned} \end{equation}$

(3.8)

but $\frac{H-1}{s} - 1 < \left[\frac{H-1}{s}\right] \leq \frac{H-1}{s}$ , thus, if we plug the previous inequality into the upper bound in Eq. (3.8), then we get:

$\begin{equation} \begin{aligned} | \mbox{cov}(\hat{\textbf{c}}_{\mu\nu}^d, \hat{\textbf{c}}_{\mu'\nu'}^{d'})| & \lt \frac{e^{\alpha_{\mu\mu'}}}{(N-1)} \frac{1- e^{\beta_{\mu\mu'} (H-1 + s)}}{(1- e^{\beta_{\mu\mu'} s})\left(\frac{H-1}{s}\right)} + \frac{1}{N} |\textbf{c}^d_{\mu\nu}\textbf{c}^{d'}_{\mu'\nu'}|. \end{aligned} \end{equation}$

(3.9)

So we get an upper bound for the covariances and, furthermore, when the set of parameters $\mu, \nu, d$ is equal to the set $\mu', \nu', d'$ , then we automatically have upper bound for the variances of the estimators.

Following an analogous procedure, we can get another upper bound w.r.t to the dimension $W$ , i.e.,

$\begin{equation} \begin{aligned} | \mbox{cov}(\hat{\textbf{c}}_{\mu\nu}^d, \hat{\textbf{c}}_{\mu'\nu'}^{d'})| & \lt \frac{e^{\alpha_{\mu\mu'}}}{(N-1)} \frac{1- e^{\beta_{\mu\mu'} (W-1 + s)}}{(1- e^{\beta_{\mu\mu'} s})\left(\frac{W-1}{s}\right)} + \frac{1}{N} |\textbf{c}^d_{\mu\nu}\textbf{c}^{d'}_{\mu'\nu'}|. \end{aligned} \end{equation}$

(3.10)

This upper bound implies that the estimators are under control. Since $\beta_{\mu\mu'} < 0$ , this upper bound increases monotonically w.r.t $s$ and thus it decreases w.r.t $P$ . So, it makes sense for us to sample more pairs even in one image.

Let us compute the limit of the upper bound when $P$ tends to infinity. The only part of it that depends on $P$ is the kernel $\frac{1- e^{\beta_{\mu\mu'} (H-1 + s)}}{(1- e^{\beta_{\mu\mu'} s})\left(\frac{H-1}{s}\right)}$ , thus we confine the computation on it:

$\begin{equation} \begin{aligned} \frac{1- e^{\beta_{\mu\mu'} (H-1 + s)}}{(1- e^{\beta_{\mu\mu'} s})\left(\frac{H-1}{s}\right)} & = \frac{s}{H - 1} + \frac{1- e^{\beta_{\mu\mu'} (H-1)}}{\frac{e^{-\beta_{\mu\mu'} s}-1}{s}(H - 1)}\\ & \xrightarrow{s \rightarrow 0} 0 + \frac{1- e^{\beta_{\mu\mu'} (H-1)}}{-\beta_{\mu\mu'}(H-1)} \gt 0. \end{aligned} \end{equation}$

(3.11)

In , we show the behavior of the kernel w.r.t. $P$ for the image dimensions of our database and for some $\beta$ values of the same magnitude of those reported in ^[16].

Figure 3. Theoretical upper bound kernel vs $P$ .

$H$ = 1288,

$W$ = 1936,

$\beta = -0.0026$ (top),

$-0.0020$ (bottom). Right dashed line is the limit of the upper bound kernel.

DownLoad: Full-Size Img PowerPoint

We can notice an asymptotic behavior of the upper bound w.r.t. $P$ , which implies that the error introduced in the computation by limiting ourselves to a value of $P$ that can be managed by an ordinary computer is negligible.

We conclude this section with two remarks. The first one is that, if we set $(\mu, \nu, d) = (\mu', \nu', d')$ , then, by Eqs. (3.9) and (3.10), then the covariance $\mbox{cov}(\hat{\textbf{c}}_{\mu\nu}^d, \hat{\textbf{c}}_{\mu'\nu'}^{d'})$ becomes the variance $\text{var}(\hat{\textbf{c}}_{\mu\nu}^d)$ and so, when $N\to +\infty$ , the upper bounds tend to 0 and thus the variance will tend to 0 as well.

$\begin{equation} \begin{aligned} \text{var}(\hat{\textbf{c}}_{\mu\nu}^d) & \lt \frac{e^{\alpha_{\mu\mu}}}{(N-1)} \frac{1- e^{\beta_{\mu\mu} (H-1 + s)}}{(1- e^{\beta_{\mu\mu} s})\left(\frac{H-1}{s}\right)} + \frac{1}{N} (\textbf{c}^d_{\mu\nu})^2, \end{aligned} \end{equation}$

(3.12)

$\begin{equation} \begin{aligned} \text{var}(\hat{\textbf{c}}_{\mu\nu}^d) & \lt \frac{e^{\alpha_{\mu\mu}}}{(N-1)} \frac{1- e^{\beta_{\mu\mu} (W-1 + s)}}{(1- e^{\beta_{\mu\mu} s})\left(\frac{W-1}{s}\right)} + \frac{1}{N} (\textbf{c}^d_{\mu\nu})^2, \end{aligned} \end{equation}$

(3.13)

The second remark is that, the previous information plus the unbiasedness of the estimators imply that the estimators $\hat{\textbf{c}}_{\mu\nu}^d$ converge to $\textbf{c}_{\mu\nu}^d$ in $L^2$ sense, so that they are $\sqrt{N}$ -consistent. Furthermore, because of the dedicated sampling strategy, $\hat{\textbf{c}}_{\mu\nu}^d$ are asymptotically normal estimators. By the delta method, we get that $\log(\hat{\textbf{c}}_{\mu\nu}^d)$ is also a $\sqrt{N}$ -consistent estimator of $\log(\textbf{c}_{\mu\nu}^d)$ , for every strictly positive $\textbf{c}_{\mu\nu}^d$ .

3.3. Regression

We now pass to the analysis of the regression step in order to estimate the slopes $\beta_{\mu \nu}$ with the OLS technique, as previously mentioned, which can be written as follows:

$\begin{equation} \log(\hat{\textbf{c}}_{\mu\nu}^d) = \alpha_{\mu\nu} + \beta_{\mu\nu}d + \epsilon_{\mu\nu}^d, \quad \mu, \nu \in \{R, G, B \}, \end{equation}$

(3.14)

where $d$ ranges in the intermediate pixel range mentioned in section 2.6. We will denote the OLS estimators of the slopes $\beta_{\mu \nu}$ as $\hat{\beta}^{\text OLS}_{\mu \nu}$ .

We start by pointing out two problems related with the use of $\log(\hat{\textbf{c}}_{\mu\nu}^d)$ : The first one is that they are correlated, so that the noise terms $\epsilon_{\mu\nu}^d$ are correlated too. The second one is that $\log(\hat{\textbf{c}}_{\mu\nu}^d)$ is likely to be biased because of the non-linearity of the logarithmic function.

As we will now underline, these two problems will have a limited impact on our computation.

Actually, in spite of the fact that the noise terms are correlated, the OLS estimators are still unbiased, the only adverse effect of correlation is that the variance of $\hat{\beta}^{\text OLS}_{\mu \nu}$ will become larger. Formally speaking, the $\hat{\beta}^{\text OLS}_{\mu \nu}$ will not be the so-called BLUE, which stands for Best Linear Unbiased Estimators.

Passing to the second problem, even if $\log(\hat{\textbf{c}}_{\mu\nu}^d)$ is biased w.r.t $\log(\textbf{c}_{\mu\nu}^d)$ , as we previously mentioned, it remains $\sqrt{N}$ -consistent. By definition of consistency, if we observe a tiny variance of $\log(\hat{\textbf{c}}_{\mu\nu}^d)$ , then its biasedness can be ignored.

As we will show in more detail in section 5, the variance of $\log(\hat{\textbf{c}}_{\mu\nu}^d)$ that we measured in practice is almost null and so is the variance of $\hat{\beta}^{\text OLS}_{\mu \nu}$ , thus biasedness is not a problem for our computations.

4. Analysis of spatiochromatic covariance matrices commutativity

To study the commutativity of spatiochromatic covariance matrices quantitatively, we need to select a measure. Let us report here some standard definition that will be useful in this section.

Quoting ^[9], we call a set $F \subset M(n, \mathbb{R})$ of matrices a commuting family of matrices if every pair of matrices in $F$ commutes. $F$ is said to be simultaneously diagonalizable if there is a single non-singular matrix $V \in M(n, \mathbb{R})$ such that $V^{-1}AV$ is diagonal for every $A \in F$ .

From classical linear algebra, it is well known that $F\subset M(n, \mathbb{R})$ is a commuting family if and only if it is a simultaneously diagonalizable family. Moreover, for any given $A \in F$ and for any given ordering $\lambda_1, \ldots, \lambda_n$ of the eigenvalues of $A$ , there is a non-singular matrix $V \in M(n, \mathbb{R})$ such that $V^{-1}AV =$ diag $(\lambda_1, \ldots, \lambda_n)$ . Finally, if $A$ is symmetric, then $V$ is orthogonal, i.e., $V^{-1} = V^T$ , the transposed of $V$ .

These considerations allow us the possibility to measure the commutativity properties of the set of estimates $\{\hat{\textbf{c}}^d \}_d$ without having to compute all the commutators. Instead, we will compute the matrix $V$ which best simultaneously diagonalizes the family of matrix estimates and we will measure the lack of commutativity by this value:

$\begin{equation} \text{JD-obj} = \sum\limits_{d}\sum\limits_{i \neq j} \left[(V^{T}\hat{\textbf{c}}^dV)_{ij}\right]^2, \end{equation}$

(4.1)

where JD-obj stands for joint diagonalisability objective and it is the sum of the square off-diagonal elements of $V^T\hat{\textbf{c}}^dV$ , where $d$ runs from 0 to some maximal distance value used compute the covariances. Of course, in the case of perfect commutativity, JD-obj would be zero while, for an almost-commuting family, the value of JD-obj will be small but not perfectly null.

4.1. Dataset description

The database we used consists of 732 raw images, of size $1288\times 1936$ , taken by a Canon 400D. In order to explore the largest possible variety of visual content of natural scenes, we have diversified as much as possible the pictures that we have taken. In Each 4-neighborhood of pixels in a raw image contains two pixels corresponding to the $R$ and $B$ channels and two pixels corresponding to the $G$ channel. Each RAW image was demosaicked to build a subsampled sRGB image simply by keeping unaltered the $R$ and $B$ information and averaging the $G$ channel.

The advantage of raw images is that they are free from post-processing operations such as gamma correction, white balance or compression, thus, modulo camera noise, they provide a much better approximation of irradiance than other images, as e.g., jpeg ones. An excerpt of this database is provided in Figure 4.

Figure 4. Excerpt of the raw image database.

DownLoad: Full-Size Img PowerPoint

However, we found out that the proportion of images containing large areas of the sky (called sky images hereafter) dominates the semantic content of the database. Too many sky images will cluster a subset, which will have a different covariance structure than the rest. Therefore, prior to the numeric studies, we need to filter part of sky images out, to balance the database.

For this purpose, we have developed a sky classifier, that we will describe in detail in the appendix.

After filtering, there are 701 images left.

4.2. Implementation

The only hyperparameter that we need to assign beforehand is the step size $s$ . In practice, we have approximately $\min(H, W)$ options for $s$ . Since we need to decrease the variance as much as possible, considering the execution time, we chose $s = 2$ $(P = 623392)$ .

The computational complexity of Eq. (3.1) is $\mathcal{O}(NP)$ . We use Matlab 9.4 ^¶ to implement the method. In the case of our database, it takes around $14$ hours to compute 281 (pixel distances) $\times$ 9 (channel combinations) estimates for raw images, while it needs 23 to 24 hours to perform a Michaelis-Menten transformation and the same estimation.

^¶ Codes to reproduce our experiments are available at https://github.com/yiyej/spatiochromatic_cov.

5. Numeric results

In this section we present and discuss the numerical results that we have obtained through our simulations.

5.1. Validations of estimators' properties

To validate the properties of estimator $\hat{\bf{c}}_{\mu\nu}^d$ , we group $n_0$ images to mimic once realization. So we have $\frac{N}{n_0}$ realizations in total. shows the unbiasedness of $\hat{\bf{c}}^d_{\mu\nu}$ . shows the empirical upper bounds of var $(\hat{\bf{c}}^d_{\mu\nu})$ .

Figure 5. Unbiasedness. Estimates of

$\hat{\bf{c}}^0_{RR}$ from different realizations. Left:

$n_0 = 2$ (350 realizations); right:

$n_0 = 5$ (140 realizations). We can find that the estimator is unbaised.

DownLoad: Full-Size Img PowerPoint

Figure 6. Empirical upper bounds. Sample variances of

$\hat{\bf{c}}^0_{RR}$ vs

$P$ . Left:

$n_0 = 2$ (350 realizations); right:

$n_0 = 5$ (140 realizations). First, the empirical upper bound decreases dramatically as

$P$ goes up, sharing the shape with theoretical upper bound kernel. Second, the effect from

$P$ on covariances is independent of

$N$ . Third, with

$5$ images, the magnitude of sample variance has already reached

$10^{-4}$ with sufficient large

$P$ .

DownLoad: Full-Size Img PowerPoint

5.2. Exponential decay of spatio-chromatic covariance of raw images

In we show the decay of $\log(\hat{\textbf{c}}_{\mu\nu}^d)$ computed for the raw images of our database, after the sky classifier, with respect to $d$ . Since we find a linear relationship, the exponential decay holds. We built the regression model and fit it with OLS. Estimates of the slopes are provided in . Notice that, up to the accuracy $10^{-4}$ , the straight lines relative to the combinations of chromatic channels $RR$ , $RG$ $(GR)$ , $RB$ $(BR)$ are parallel and the same is true for those relatives to the combinations $GG$ and $GB$ $(BG)$ . All the straight lines are parallel up to the accuracy $10^{-3}$ .

Figure 7. Exponential decay of raw images. Parameter setting:

$s = 2 (P = 623392)$ ,

$d = 0, 1, ... 280$ . We built the regression model and fit it with OLS on the range [70, 280], where the covariances exhibit exponential decay precisely.

DownLoad: Full-Size Img PowerPoint

Table 1. Estimates of the slopes from our database of raw images after the application of the sky classifier.

$\beta_{RR}$	$\beta_{RG}\; (\beta_{GR})$	$\beta_{RB}\; (\beta_{BR})$
-0.00228	-0.00228(-0.00228)	-0.00229(-0.00229)
$\beta_{GG}$	$\beta_{GB} \; (\beta_{BG})$
-0.00222	-0.00218(-0.00218)
$\beta_{BB}$
-0.00210

| Show Table

DownLoad: CSV

5.3. Effects of the Michaelis-Menten transformation on commutativity

In this subsection, we will be focusing on the effects of the Michaelis-Menten transformation, $\textbf{u}_\mu(x) \mapsto \textbf{u}_\mu^\gamma(x)/(\textbf{u}_\mu^\gamma(x) + m_\mu^\gamma)$ on the commutativity properties of spatio-chromatic covariance matrices.

Firstly, we compute the JD-obj measure, Eq. (4.1) for the original raw images of our database after the action of the sky classifier. Then, we transform the raw images applying the Michaelis-Menten formula with 9 different $\gamma$ values, ranging from 0.2 to 1 with step 0.1 and we compute again the JD-obj measure.

It is clear that, if $\gamma \to 0$ , then the Michaelis-Menten formula will turn all the image pixels to $1/2$ , thus leaving with constant images and all spatiochromatic covariance matrices would be identical and perfectly commuting. Since we want to avoid this trivial situation, we remain far from the value $\gamma = 0$ by starting with $\gamma = 0.2$ .

In Figure 8 it can be seen that also after the Michaelis-Menten transformation the exponential decay of covariance holds true^||.

^|| It is worth mentioning that the Michaelis-Menten transformation interchanges the position of some straight lines. For example, when $\gamma = 0.7$ , the lines $RB(BR)$ and $RR$ are shifted up w.r.t $RG(GR)$ and $GG$ , respectively.

Figure 8. Exponential decay of covariance after the application of the Michaelis-Menten formula with

$\gamma = 0.7$ . Parameter setting:

$s = 2 (P = 623392)$ ,

$d = 0, 1, ... 280$ . The choice of

$\gamma = 0.7$ is arbitrary, but we stress that the exponential decay remains true also for all the other

$\gamma$ values that we have considered.

DownLoad: Full-Size Img PowerPoint

Let us now discuss the quantitative results about the commutativity measure, i.e., the JD-obj values. For the sake of clarity, let us write JD-obj $(\text{raw})$ and JD-obj $(\text{MM})$ for the JD-obj values obtained with the original raw images and the transformed ones, with the Michaelis-Menten transformation, respectively.

We have performed experiments to compute these values on the family of matrices $\{\hat{\textbf{c}}^d\}_{d = 0, \ldots, d_M}$ by changing the value of $\gamma$ .

One interesting result is that for all $d_M$ and for all $\gamma$ greater than 0.5 JD-obj $(\text{MM}) <$ JD-obj $(\text{raw})$ . This remains true also for some values of $\gamma$ smaller than 0.5 but, in this case, the relationship between JD-obj $(\text{MM})$ and JD-obj $(\text{raw})$ depends on $d_M$ .

In we show the JD-obj $(\text{MM})$ with different $\gamma$ parameters and with $d_M$ ranging from 10 to 90. The horizontal line is JD-obj $(\text{raw})$ . For all these distance values, we empirically verified that when $\gamma \geq 0.6$ , then the Michaelis-Menten transformation improves the commutativity of the family $\{ \hat{\textbf{c}}^d \}_{d = 0, \ldots, 75}$ , whilst, for $\gamma \leq 0.5$ , the Michaelis-Menten transformation may improve or not the commutativity.

Figure 9. JD-obj values. JD-obj

$(\text{MM})$ with different

$\gamma$ parameters and with

$d_M$ ranging from 10 to 90.

DownLoad: Full-Size Img PowerPoint

In we show the JD-obj $(\text{MM})$ with different $\gamma$ parameters and with $d_M$ ranging from 100 to 280. Here there is no horizontal line showing JD-obj $(\text{raw})$ because, for all these distance values and for all $\gamma$ the Michaelis-Menten transformation improves the commutativity of the family $\{ \hat{\textbf{c}}^d \}_{d = 0, \ldots, 75}$ . However, we can notice that the optimal value of $\gamma$ , corresponding the absolute minimum of the curve representing JD-obj $(\text{MM})$ , gradually shifts towards 0. One explanation for this behavior is the following: When a large number of matrices is considered, much more noise will be introduced in the computation, in this case, very small values of $\gamma$ tend to make the family of matrices more homogeneous, thus improving commutativity. Plus, if these matrices do not really commute, then the best way of forcing them to commute is to make them having more homogeneous values.

Figure 10. JD-obj values. JD-obj

$(\text{MM})$ with different

$\gamma$ parameters and with

$d_M$ ranging from 100 to 280.

DownLoad: Full-Size Img PowerPoint

We believe that, focusing on small $d_M$ will help revealing the true information about the action of the Michaelis-Menten transformation. Interestingly, 0.9 is the first value of $\gamma$ to be best, before $\gamma$ decreases back to small values.

In we indicate the best $\gamma$ value with respect to commutativity for all the distances from 1 to 280.

Figure 11. The best

$\gamma$ values for the families

$\{ \hat{\textbf{c}}^d \}_{d = 0, \ldots, d_M}$ . The

$x$ -axis records the moving of

$d_M$ .

DownLoad: Full-Size Img PowerPoint

6. Conclusions and perspectives

The work of this paper is inspired by the analysis of spatiochromatic features of natural images provided in ^[16]. Our contributions to the improvements of this paper are the following.

First of all, we constructed a collection of estimators for spatiochromatic covariance matrices that is reliable from the perspectives of unbiasedness and consistency. This construction is based on a method that permits to exploit as much as possible the information of each image, thus allowing the use of relatively small databases of images, as those typically available for raw or multispectral natural images. Our proposal is general and may be applied to reduces the amount of sample needed by any image statistics model.

Moreover, we devised a sky classifier which allowed us to remove from our database images with statistically redundant information about the sky.

We also verified with great accuracy the exponential decay of spatiochromatic covariance for raw images, showing that, up to a $10^{-3}$ accuracy, the exponential decay coefficient is the same for each combination of chromatic channels, if we avoid the very first distances which are affected by noise and errors introduced by the convolution kernel of cameras. The consequence is that, up to this accuracy, spatio-chromatic covariance matrices commute and the results of paper ^[16] about the possibility to separate the codification of spatial and chromatic visual signals into a tensor product hold true.

Finally, we have analyzed the consequences of the application of the Michaelis-Menten transformation to our raw data. If raw data can be associated to the radiance of a visual scene, their Michaelis-Menten transformed can be associated with the output of retinal cones after the absorption of light.

So, it is natural to test if the Michaelis-Menten transformation has an effect on the commutativity of spatio-chromatic covariance matrices, allowing a more precise and efficient tensor product codification of spatial and chromatic visual signals by the optical neurons.

The Michaelis-Menten transformation depends on a parameter $\gamma$ , which has been measured as $0.74$ for the retina of a rhesus monkey. Our tests have confirmed that the exponential decay is retained after the application of the Michaelis-Menten transformation and, remarkably, that, when $\gamma$ ranges between 0.6 and 1, the commutativity of spatio-chromatic covariance matrices is actually improved with respect to the original raw image values.

The results that we have obtained are very promising and confirm the conjectures of paper ^[16] about the importance of the Michaelis-Menten transformation. However, for a full proof of these assumptions a database of natural mulispectral images should be built and analyzed with the techniques described in this paper. Technological limitations of multispectral cameras do not allow this for the moment when movement (e.g., leaves moved by the wind or people walking) is considered.

Acknowledgments

Jérémie Bigot is a member of Institut Universitaire de France (IUF), and this work has been carried out with financial support from the IUF. Edoardo Provenzi acknowledges a partial support from the CNRS grant 80primes.

Conflict of interest

The authors declare no conflict of interest.

Appendix

Sky classifier

The key point of this classifier is to control the distribution mass of blue channel and red channel. After a statistical analysis of our database, we found out that, in general, the objects appearing in the pictures are characterized by high values of red, while, of course, the sky is always characterized by high values of blue.

We only consider sky in the daylight. Figure 12 shows one common distribution of daylight sky. We can see that the mass of blue channel distribution of the sky area is located in the high-valued range, so that there is an enough number of pixels capable of exhibiting `bright blue'. Also, the mass of red channel is located out of the high-valued range. Thus, most of pixels in sky images in our database are characterized by a simultaneous presence of large amount of high-valued blue pixels and a relatively small amount of high-valued red pixels.

Figure 12. One typical distribution of sky images. The histogram is plotted with 10 bins and pixels are taken from the upper 1/3 part of the corresponding image.

DownLoad: Full-Size Img PowerPoint

Inspired by the analysis above, we defined the Boolean variable that labels sky in our classifier as follows:

${\rm{label}} = ({\rm{Quantil}}{{\rm{e}}_B}({p_B}) \gt {\theta _B}){\rm{AND}}({\rm{Quantil}}{{\rm{e}}_R}({p_R}) \lt {\theta _R}),$

where, Quantile $_\mu$ ( $p_\mu$ ) is the quantile of the probability $p_\mu$ for the pixel value distribution of the chromatic channels $\mu = B$ or $R$ and $\theta_B$ should be located in the high-valued range, $\theta_R$ should be located out of high-valued range. %By requiring this, we can assure that the mass of blue channel is located in high-valued range and that the mass of red channel is located out of high-valued range.

If our database contained only horizontal images, we could limit our sky classifier only to the upper 1/3 part of an image. However, as can be seen from , we also have to deal with rotated vertical images, thus, in order to take into account both image geometries, we apply our algorithm only on the top right part of the images. More specifically, we considered the pixels belonging to the top-right part of the image, i.e., the sub-image with coordinates $(x, y)$ , $x\geq (1-\lambda) W$ and $0\leq y\leq \lambda H$ .

We then compute Quantile $_\mu$ ( $p_\mu$ ) in each sub-image and the label variable. If the label turns out to be 1, then we take the image out of our database.

Notice that pictures with a significant amount of clouds in the sky will, correctly, not be removed, because in this case there will be a significant amount of bright red pixels.

Moreover, if there is a sufficient amount of detail in the sky, as shown by Figure 13, the classifier will not remove the image from the database.

Figure 13. The sky classifier did not remove this image from the database thanks to the presence of the visible gradient which is responsible for the distributions shown in the graph at the right.

DownLoad: Full-Size Img PowerPoint

By correctly tuning the parameters of the classifier, we can control the proportion of sky images to be taken out. The tuning for our database gave this parameter selection: $\lambda = 0.4$ , $p_B$ = 0.4, $p_R$ = 0.6, $\theta_B$ = 0.6, $\theta_R$ = 0.6 which classified 31 pictures as sky images over 732, reported in Figure 14.

Figure 14. Pictures taken out of our database because labeled as sky images by our classifier.

DownLoad: Full-Size Img PowerPoint

References

[1]	Attneave F (1954) Some informational aspects of visual perception. Physchol Rev 61: 183-193.
[2]	Barlow HB (1961) Possible principles underlying the transformations of sensory messages. Sens Commun 1: 217-234.
[3]	Berman A, Plemmons RJ (1987) Nonnegative Matrices in the Mathematical Sciences, SIAM.
[4]	Buchsbaum G, Gottschalk A (1983) Trichromacy, opponent colours coding and optimum colour information transmission in the retina. P Roy Soc Lond B Bio 220: 89-113.
[5]	Field DJ (1987) Relations between the statistics of natural images and the response properties of cortical cells. J Opt Soc Am 4: 2379-2394. doi: 10.1364/JOSAA.4.002379
[6]	Frazier MW (2001) Introduction to Wavelets through Linear Algebra, Springer.
[7]	Gonzales RC, Woods RE (2002) Digital Image Processing, Prentice Hall.
[8]	Gray RM (2006) Toeplitz and circulant matrices: A review. Found Trends Commun Inform Theory 2: 155-239.
[9]	Johnson CR, Horn RA (1985) Matrix Analysis. Cambridge: Cambridge University Press.
[10]	Johnson GM, Song X, Montag ED, et al. (2010) Derivation of a color space for image color difference measurement. Color Res Appl 35: 387-400. doi: 10.1002/col.20561
[11]	MacKay DM (1956) Towards an information-flow model of human behaviour. Brit J Psy 47: 30-43. doi: 10.1111/j.2044-8295.1956.tb00559.x
[12]	Ohta Y, Kanade T, Sakai T (1980) Color information for region segmentation. Comput Graph Image Process 13: 222-241. doi: 10.1016/0146-664X(80)90047-7
[13]	Olshausen B, Field DJ (1997) Sparse coding with an overcomplete basis set: A strategy employed by v1?. Vision Res 37: 607-609.
[14]	Párraga C, Troscianko T, Tolhurst D (2002) Spatiochromatic properties of natural images and human vision. Curr Bio 6: 483-487.
[15]	Pratt WK (2007) Digital Image Processing, J. Wiley & Sons.
[16]	Provenzi E, Delon J, Gousseau Y, et al. (2016) On the second order spatiochromatic structure of natural images. Vision Res 120: 22-38. doi: 10.1016/j.visres.2015.02.025
[17]	Rao CR (1973) Linear Statistical Inference and Its Applications, John Wiley and Sons.
[18]	Ruderman DL (1996) Origin of scaling in natural images. Vision Res 37: 3385-3398.
[19]	Ruderman DL, Cronin TW, Chiao C (1998) Statistics of cone responses to natural images: Implications for visual coding. J Opt Soc Am A 15: 2036-2045. doi: 10.1364/JOSAA.15.002036
[20]	Shapley R, Enroth-Cugell C (1984) Visual adaptation and retinal gain controls. Prog Retin Res 3: 263-346. doi: 10.1016/0278-4327(84)90011-7

Reader Comments

Your name:*

Email:*
© 2020 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Mathematics in Engineering

1.4 2.2

Metrics

Article views(3146) PDF downloads(343) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(14) / Tables(1)

Mathematics in Engineering

Commutativity of spatiochromatic covariance matrices in natural image statistics

Related Papers:

Abstract

1. Introduction

2. Spatiochromatic features of natural images

2.1. Chromatic redundancy in natural images

2.2. Spatiochromatic redundancy in natural images

2.3. Second order stationarity

2.4. The gray-level case

2.5. The color case

2.6. Commutativity and exponential decay of covariance matrices

3. Estimation of spatiochromatic covariances

3.1. Construction of covariance estimators

3.2. Properties of $\hat{\textbf{c}}_{\mu\nu}^d$

3.3. Regression

4. Analysis of spatiochromatic covariance matrices commutativity

4.1. Dataset description

4.2. Implementation

5. Numeric results

5.1. Validations of estimators' properties

5.2. Exponential decay of spatio-chromatic covariance of raw images

5.3. Effects of the Michaelis-Menten transformation on commutativity

6. Conclusions and perspectives

Acknowledgments

Conflict of interest

Appendix

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Mathematics in Engineering

Commutativity of spatiochromatic covariance matrices in natural image statistics

Related Papers:

Abstract

1. Introduction

2. Spatiochromatic features of natural images

2.1. Chromatic redundancy in natural images

2.2. Spatiochromatic redundancy in natural images

2.3. Second order stationarity

2.4. The gray-level case

2.5. The color case

2.6. Commutativity and exponential decay of covariance matrices

3. Estimation of spatiochromatic covariances

3.1. Construction of covariance estimators

3.2. Properties of ˆcdμν \hat{\textbf{c}}_{\mu\nu}^d

3.3. Regression

4. Analysis of spatiochromatic covariance matrices commutativity

4.1. Dataset description

4.2. Implementation

5. Numeric results

5.1. Validations of estimators' properties

5.2. Exponential decay of spatio-chromatic covariance of raw images

5.3. Effects of the Michaelis-Menten transformation on commutativity

6. Conclusions and perspectives

Acknowledgments

Conflict of interest

Appendix

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

3.2. Properties of $\hat{\textbf{c}}_{\mu\nu}^d$