The generalized scale mixtures of asymmetric generalized normal distributions with application to stock data

Ruijie Guan; Aidi Liu; Weihu Cheng; Ruijie Guan; Aidi Liu; Weihu Cheng

doi:10.3934/math.2024064

AIMS Mathematics

2024, Volume 9, Issue 1: 1291-1322. doi: 10.3934/math.2024064

Previous Article Next Article

Research article

The generalized scale mixtures of asymmetric generalized normal distributions with application to stock data

Faculty of Science, Beijing University of Technology, Beijing 100124, China

Received: 11 August 2023 Revised: 23 November 2023 Accepted: 30 November 2023 Published: 07 December 2023
MSC : 62E15, 62F10, 62G30, 62P05

In this paper, we introduced a family of distributions with a very flexible shape named generalized scale mixtures of generalized asymmetric normal distributions (GSMAGN). We investigated the main properties of the new family including moments, skewness, kurtosis coefﬁcients and order statistics. A variant of the expectation maximization (EM)-type algorithm was established by combining the proflie likihood approach (PLA) with the classical expectation conditional maximization (ECM) algorithm for parameter estimation of this model. This approach with analytical expressions in the E-step and tractable M-step can greatly improve the computational speed and efficiency of the algorithm. The performance of the proposed algorithm was assessed by some simulation studies. The feasibility of the proposed methodology was illustrated through two real datasets.

Keywords:

generalized asymmetric normal distribution,
generalized scale mixtures,
EM-type algorithm,
order statistics

Citation: Ruijie Guan, Aidi Liu, Weihu Cheng. The generalized scale mixtures of asymmetric generalized normal distributions with application to stock data[J]. AIMS Mathematics, 2024, 9(1): 1291-1322. doi: 10.3934/math.2024064

Related Papers:

[1]	Pingyun Li, Chuancun Yin . Tail risk measures with application for mixtures of elliptical distributions. AIMS Mathematics, 2022, 7(5): 8802-8821. doi: 10.3934/math.2022491
[2]	Essam A. Ahmed, Laila A. Al-Essa . Inference of stress-strength reliability based on adaptive progressive type-Ⅱ censing from Chen distribution with application to carbon fiber data. AIMS Mathematics, 2024, 9(8): 20482-20515. doi: 10.3934/math.2024996
[3]	Guangshuai Zhou, Chuancun Yin . Family of extended mean mixtures of multivariate normal distributions: Properties, inference and applications. AIMS Mathematics, 2022, 7(7): 12390-12414. doi: 10.3934/math.2022688
[4]	Mashael A. Alshehri, Mohamed Kayid . Copulas generated by mixtures of weighted distributions. AIMS Mathematics, 2022, 7(5): 8953-8974. doi: 10.3934/math.2022499
[5]	Alanazi Talal Abdulrahman, Khudhayr A. Rashedi, Tariq S. Alshammari, Eslam Hussam, Amirah Saeed Alharthi, Ramlah H Albayyat . A new extension of the Rayleigh distribution: Methodology, classical, and Bayes estimation, with application to industrial data. AIMS Mathematics, 2025, 10(2): 3710-3733. doi: 10.3934/math.2025172
[6]	Muhammad Aslam, Mohammed Albassam . Neutrosophic geometric distribution: Data generation under uncertainty and practical applications. AIMS Mathematics, 2024, 9(6): 16436-16452. doi: 10.3934/math.2024796
[7]	Mohamed S. Eliwa, Essam A. Ahmed . Reliability analysis of constant partially accelerated life tests under progressive first failure type-II censored data from Lomax model: EM and MCMC algorithms. AIMS Mathematics, 2023, 8(1): 29-60. doi: 10.3934/math.2023002
[8]	Tabassum Naz Sindhu, Zawar Hussain, Naif Alotaibi, Taseer Muhammad . Estimation method of mixture distribution and modeling of COVID-19 pandemic. AIMS Mathematics, 2022, 7(6): 9926-9956. doi: 10.3934/math.2022554
[9]	H. M. Barakat, M. A. Alawady, I. A. Husseiny, M. Nagy, A. H. Mansi, M. O. Mohamed . Bivariate Epanechnikov-exponential distribution: statistical properties, reliability measures, and applications to computer science data. AIMS Mathematics, 2024, 9(11): 32299-32327. doi: 10.3934/math.20241550
[10]	Areej M. AL-Zaydi . On concomitants of generalized order statistics arising from bivariate generalized Weibull distribution and its application in estimation. AIMS Mathematics, 2024, 9(8): 22002-22021. doi: 10.3934/math.20241069

Abstract

1. Introduction

In the wake of the swift progression of technology over recent decades, a noteworthy proliferation of diverse data types has ensued. A salient feature inherent in many of these datasets is their proclivity toward heavy tails and marked skewness, thereby rendering conventional univariate distributions inadequately applicable. Consequently, considerable scholarly focus has been directed toward the formulation of robust distributions adept at effectively modeling such intricately structured data.

Andrews and Mallows (1974) ^[1] and West (1987) ^[2] proposed a scale mixtures of normal (SMN) distributions with the following stochastic representation

$\begin{align} Y = \mu+U^{-\frac{1}{2}}Z, \end{align}$

(1.1)

where $\mu\in\mathbb{R}$ is the location parameter, $Z \sim N(0, \sigma^{2})$ , and $U$ is a positive random variable independent with $Z$ . The SMN distributions provide great flexibility in modeling not only asymmetric but also heavy-tailed datasets. Although SMN distributions are attractive, when facing asymmetric population with strong skewness and heavy-tailed behavior simultaneously in practice, SMN distribution appears overwhelming. To cope with this problem, Branco and Dey (2001) ^[3] constructed the scale mixtures of skew-normal (SMSN) distribution by replacing the normal distribution of Eq (1.1) to the skew-normal distribution. This class of distributions not only contains the entire family of SMN distributions, but also accounts for asymmetry and heavy tails simultaneously. Ferreira, Lachos, and Bolfarine (2020)^[4] pointed out that scale mixtures of the skew normal distribution can offer much-needed flexibility by combining both skewness and heavy tails. Basso (2010) ^[5] developed an EM-type algorithm to obtain the maximum likelihood estimation for the SMSN family and derived the observed information matrix. Kim and Genton (2011) ^[6] obtained the characteristic functions for the SMSN family by means of the simple stochastic representation. Lin et al. contributed a great deal of important work for scale mixture models based on the skew-normal distribution, which has been found to be effective in the treatment of heavy-tailed data involving asymmetric behaviors ^[7,8,9]. Mahdavi et al. ^[10] obtained the maximum likelihood estimation for scale-shape mixtures of flexible generalized skew normal distributions via EM-type algorithms.

The utilization of the skew normal distribution in the aforementioned study is rooted in the seminal work of Azzalini (1985) ^[11]. However, it is essential to recognize the limitations of this distribution when confronted with highly skewed and leptokurtic data, as its ability to accurately fit such data may be constrained by the range of the skewness and kurtosis coefficients. To explore more recent advancements in the field of skew normal distribution, we recommend consulting the works of ^{[12,13,14,15,16]}. Acknowledging these limitations, it becomes imperative to develop alternative forms of skew normal distributions that can accommodate a broader spectrum of diverse data. One approach in this direction was introduced by Fernandez and Steel (1988) ^[17], who incorporated a skew parameter into a symmetric distribution. This innovation enabled the regulation of the probability density functions in both the positive and negative orthants, offering enhanced flexibility. Another notable contribution is the work of Zhu (2009) ^[18], who proposed the asymmetric exponential power distribution by introducing a skew parameter into the exponential power distribution. Zhu further demonstrated the consistency, asymptotic normality, and efficiency of the maximum likelihood estimation for this distribution. These advancements signify important strides in the development of alternative skew normal distributions that can better accommodate complex data patterns.

In this work, we have introduced an asymmetric generalized normal (AGN) distribution by incorporating a skew parameter into the generalized normal (GN) distribution ^[19]. This extension provides a flexible framework for modeling various types of data exhibiting asymmetric behavior and heavy-tailed characteristics. However, in many practical scenarios, the observed data often exhibits strongly heavy tails and significant skewness. To address this challenge, we turn our attention to the family of generalized scale mixtures of the asymmetric generalized normal (GSMAGN) distributions. This family represents a generalization of traditional scale mixtures, wherein we replace the power $-\frac{1}{2}$ of the scale factor in Eq (1.1) with any power $-\frac{1}{\beta}$ , with $\beta > 0$ . By employing this generalization, we gain greater flexibility in capturing the complexities present in the data and better accommodate the wide range of tail and skewness behaviors encountered in practical applications.

The rest of this paper will be organized as follows. In section two, some definitions and essential properties of the GSMAGN family are summarized. In section three, the explicit expression for moments of order statistics from the GSMAGN family are obtained under independent identically distributed (IID) cases. In section four, an efficient and tractable EM-type algorithm to obtain the maximum likelihood estimation of parameters is established. In section five, the consistency properties of the estimates and the estimation of standard errors are demonstrated. In section six, a simulation study is implemented to assess the performance of the proposed algorithm. In section seven, two real dataset analyses are presented to demonstrate the feasibility of the proposed model. Finally, conclusions are noted in section eight.

2. The generalized scale mixtures of asymmetric generalized normal distributions

2.1. Definition and properties

Definition 2.1. Let the random variable $X$ with the following probability density function (pdf) be

$\begin{align} f_{AGN}(x;\mu,\sigma,\kappa,\beta) = \frac{\kappa\beta}{2^{1/\beta}(1+\kappa^2)\Gamma(1/\beta)\sigma}\exp\bigg\{-\frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(x-\mu)}\frac{|x-\mu|}{\sigma}\bigg)^\beta\bigg\},\qquad x\in \mathbb{R}, \end{align}$

(2.1)

where $\Gamma(\cdot)$ denotes the gamma function, $\mu\in \mathbb{R}$ is a location parameter, $\sigma > 0$ is a scale parameter, $\kappa > 0$ is the skewness parameter and $\beta > 0$ is a shape parameter. We say that the random variable $X$ follows an AGN distribution, denoted by $X\sim AGN(\mu, \sigma, \kappa, \beta)$ . In particular, if $\kappa = 1$ , the AGN distribution becomes a GN distribution ^[19] denoted by $GN(\mu, \sigma, \beta)$ .

Definition 2.2. A random variable $Y$ is in the GSMAGN family if its stochastic representation is given by

$\begin{align} Y = \mu+\sigma[k(U)]^{1/\beta}X, \end{align}$

(2.2)

where $X\sim AGN(0, 1, \kappa, \beta)$ , $\mu$ is the location parameter, $k(\cdot)$ is a weight function, and $U$ is a random variable.

If $U = u$ , then $Y\mid U = u\sim AGN(\mu, [k(U)]^{1/\beta}\sigma, \kappa, \beta)$ and the pdf of $Y$ is given by

$\begin{align} f_{GSMAGN}(y;\mu,\sigma,\kappa,\alpha,\beta) = \int_{0}^{+\infty}f_{AGN}\bigg(y;\mu,[k(u)]^{1/\beta}\sigma,\kappa,\beta\bigg)\mbox{d}H(u;\boldsymbol{\alpha}), \end{align}$

(2.3)

where $H(\cdot; \boldsymbol{\alpha})$ is the cumulative distribution function (CDF) of $U$ indexed by parameter $\boldsymbol{\alpha}$ . To imbue the proposed distributions that have some compelling mathematical properties to facilitate statistical inference, usually the weight function will be chosen for $k(u) = \frac{1}{u}$ .

Proposition 2.1. If $Y\sim GSMAGN(\mu, \sigma, \kappa, \alpha, \beta)$ , then the stochastic representation of $Y$ can be written as follows

$\begin{align} Y = \mu+\sigma U^{-\frac{1}{\beta}}I (2Z)^{\frac{1}{\beta}}, \end{align}$

(2.4)

where $U\sim H(\cdot; \boldsymbol{\alpha})$ , $Z\sim Gamma(1/\beta, 1)$ , $I$ is a discrete random variable with probability mass function $f_I = \frac{\kappa^2}{1+\kappa^2}\mathbf{1}_{\{I = -\kappa\}}+\frac{1}{1+\kappa^2}\mathbf{1}_{\{I = 1/\kappa\}}$ , and $U, I, Z$ are independent of each other.

Proposition 2.2. If $Y\sim GSMAGN(\mu, \sigma, \kappa, \alpha, \beta)$ , then

$\mathrm{E}(Y^m) = \sum\limits_{i = 0}^{m}\binom{m}{i}\sigma^i\mu^{m-i}2^{\frac{i}{\beta}}\frac{\Gamma[(i+1)/\beta]}{\Gamma(1/\beta)}\frac{1}{1+\kappa^2}\bigg[\frac{1}{\kappa^i}+(-1)^i\kappa^{2+i}\bigg] \mathrm{E}(U^{-\frac{i}{\beta}}),\quad m\in N^+.$

In particular, the expectation and variance of the random variable $Y$ are respectively given by

$\begin{align*} \mu_Y = & \mathrm{E}(Y) = \mu+\sigma2^{\frac{1}{\beta}}\frac{\Gamma(2/\beta)}{\Gamma(1/\beta)}\frac{1}{1+\kappa^2}\bigg(\frac{1}{\kappa}-\kappa^3\bigg) \mathrm{E}(U^{-\frac{1}{\beta}}),\\ \mathrm{Var}(Y) = &\sigma^2 2^{\frac{2}{\beta}}\bigg\{\frac{1+\kappa^6}{\kappa^2(1+\kappa^2)}\frac{\Gamma(3/\beta)}{\Gamma(1/\beta)}\mathit{\mbox{E}}(U^{-\frac{2}{\beta}})-\frac{(1-\kappa^2)^2}{\kappa^2}\bigg[\frac{\Gamma(2/\beta)}{\Gamma(1/\beta)} \mathrm{E}(U^{-\frac{1}{\beta}})\bigg]^2\bigg\}\; . \end{align*}$

Proof. See Appendix Part 1. □

2.2. Examples of the GSMAGN distributions

(ⅰ) AGN distribution

In this case $U = 1$ , the CDF of $Y$ is

$\begin{align*} F_{AGN}(y; \mu,\sigma,\kappa,\alpha,\beta) = \left\{\begin{aligned} \; \frac{\kappa^2}{(1+\kappa^2)} \bigg[1-\gamma\bigg(\frac{1}{\beta},\frac{1}{2}\bigg(\frac{\mu-y}{\kappa\sigma}\bigg)^{\beta}\bigg)\bigg],&&\; \; {\rm if}\; y < \mu,\\ \; \frac{\kappa^2}{1+\kappa^2}+\frac{1}{(1+\kappa^2)}\gamma\bigg(\frac{1}{\beta},\frac{1}{2}\bigg(\frac{\kappa(y-\mu)}{\sigma}\bigg)^{\beta}\bigg) ,&&\; \; {\rm if}\; y\geq\mu, \end{aligned}\right. \end{align*}$

where $\mathit{\boldsymbol{\gamma}}(\alpha; x) = \frac{1}{\varGamma(\alpha)}\int_{0}^{x}t^{\alpha-1}e^{-t} \mathrm{d}t$ denotes the lower incomplete gamma function.

(ⅱ) Asymmetric generalized $t$ distribution (AGT)

In this case we consider $U\sim Gamma(\alpha, \beta)$ ( $\boldsymbol{\alpha} = (\alpha, \beta)$ for $H(u; \mathit{\boldsymbol{\alpha}})$ ) and the pdf of $Y$ is given by

$\begin{align*} f_{AGT}(y; \mu,\sigma,\kappa,\alpha,\beta) = \frac{2\kappa}{(2/\beta)^{1+\frac{1}{\beta}}(1+\kappa^2) {\mathrm{B}}(\alpha,1/\beta)\sigma}\bigg[1+\frac{\beta}{2}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg]^{-(\alpha+1/\beta)}\; ,\quad y\in \mathbb{R}, \end{align*}$

and the CDF of $Y$ can be represented as

$\begin{align*} F_{AGT}(y; \mu,\sigma,\kappa,\alpha,\beta) = \left\{\begin{aligned} \; \frac{\kappa^2}{(1+\kappa^2)} \bigg[1-I \bigg(\frac{1}{\beta},\alpha;h_1(y)\bigg)\bigg],&&\; \; {\rm if}\; y < \mu,\\ \; \frac{\kappa^2}{1+\kappa^2}+\frac{1}{(1+\kappa^2)} I\bigg(\frac{1}{\beta},\alpha;h_2(y)\bigg),&&\; \; {\rm if}\; y\geq\mu, \end{aligned}\right. \end{align*}$

where

$h_1(y) = \frac{\beta(\mu-y)^\beta}{2\kappa^\beta\sigma^\beta+\beta(\mu-y)^\beta}\; ,\qquad h_2(y) = \frac{\beta\kappa^\beta(y-\mu)^\beta}{2\sigma^\beta+\beta\kappa^\beta(y-\mu)^\beta}\; .$

${\mathrm{B}}(\cdot, \cdot)$ denotes the beta function, $I(a, b;x) = \frac{1}{ {\mathrm{B}}(a, b)}\int_{0}^{x}t^{a-1}(1-t)^{b-1} \mathrm{d}t$ denotes the incomplete beta function, and $\alpha, \beta > 0$ are two shape parameters, then we say that the random variable $Y$ follows AGT distribution, denoted by $Y\sim AGT(\mu, \sigma, \kappa, \alpha, \beta)$ .

Corollary 2.1. If $Y\sim AGT(\mu, \sigma, \kappa, \alpha, \beta)$ , then

$\begin{align*} \mathrm{E}(Y^m) = \sum\limits_{i = 0}^{m}\binom{m}{i}\sigma^i\mu^{m-i}\bigg(\frac{2}{\beta}\bigg)^{\frac{i}{\beta}}\frac{\varGamma(\alpha-i/\beta)\varGamma[(i+1)/\beta]}{\varGamma(\alpha)\varGamma(1/\beta)}\frac{1}{1+\kappa^2}\bigg[\frac{1}{\kappa^i}+(-1)^i\kappa^{2+i}\bigg],\; \; \; \alpha\beta > m. \end{align*}$

In particular, the expectation and variance of $Y$ are respectively given by

$\begin{align*} \mu_Y = & \mathrm{E}(Y) = \mu+\sigma\bigg(\frac{2}{\beta}\bigg)^{\frac{1}{\beta}}\frac{\Gamma(\alpha-1/\beta)\Gamma(2/\beta)}{\Gamma(\alpha)\varGamma(1/\beta)},\\ \mathrm{Var}(Y) = &\sigma^2\bigg(\frac{2}{\beta}\bigg)^{\frac{2}{\beta}}\bigg\{\frac{1+\kappa^6}{\kappa^2(1+\kappa^2)}\frac{\Gamma(\alpha-2/\beta)\Gamma(3/\beta)}{\Gamma(\alpha)\Gamma(1/\beta)}-\frac{(1-\kappa^2)^2}{\kappa^2}\bigg[\frac{\Gamma(\alpha-1/\beta)\Gamma(2/\beta)}{\Gamma(\alpha)\Gamma(1/\beta)}\bigg]^2\bigg\}. \end{align*}$

Proposition 2.3. If $Y\sim AGT(\mu, \sigma, \kappa, \alpha, \beta)$ , then $U| Y = y\sim Gamma\bigg(\alpha+\frac{1}{\beta}, {\bigg[\frac{1}{\beta}+\frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg]}^{-1}\bigg)$ .

Proof. See Appendix Part 1. □

(ⅲ) Asymmetric generalized slash distribution (AGSL)

In this case, we have $U\sim Beta(\alpha, 1)$ and the pdf of $Y$ is given by

$\begin{align*} f_{AGSL}(y;\mu,\sigma,\kappa,\alpha,\beta) = &\frac{\kappa\alpha\beta\varGamma(\alpha+1/\beta)}{2^{1/\beta}(1+\kappa^2)\varGamma(1/\beta)\sigma} \bigg[\frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg]^{-(\alpha+1/\beta)}\\ & \times\gamma\bigg[\alpha+\frac{1}{\beta};\frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg], \qquad y\in \mathbb{R}. \end{align*}$

All parameters play the same role as in AGT distribution, then we say that the random variable $Y$ follows an AGSL distribution, denoted by $Y\sim AGSL(\mu, \sigma, \kappa, \alpha, \beta)$ .

Proposition 2.4. If $Y\sim AGSL(\mu, \sigma, \kappa, \alpha, \beta)$ , then the pdf of $U\mid Y = y$ is given by

$f_{U\mid Y}(u\mid y) = \frac{\bigg[\frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg]^{(\alpha+1/\beta)}} {\Gamma(\alpha+1/\beta)\gamma\bigg[\alpha+\frac{1}{\beta};\frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg]}u^{\alpha+1/\beta-1}\exp\bigg[-\frac{u}{2}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg], 0 < u < 1.$

Proof. See Appendix Part 1. □

(ⅳ) Contaminated asymmetric generalized normal distribution (CAGN)

In this case, $U$ is a discrete random variable taking one of two states with the following probability mass function

$\begin{align} h(u;\boldsymbol{\alpha}) = \alpha {1}_{(u = \lambda)}+(1-\alpha){1}_{(u = 1)}\; ,\qquad 0 < \alpha < 1,\quad 0 < \lambda\leq 1, \qquad i.e.,\quad \boldsymbol{\alpha} = (\alpha,\lambda). \end{align}$

(2.5)

It follows immediately that the pdf of the random variable $Y$ is given by

$\begin{align*} f_{CAGN}(y;\mu,\sigma,\kappa,\lambda,\alpha,\beta) = \alpha f_{AGN}(y;\mu,\lambda^{-1/\beta}\sigma,\kappa,\beta)+(1-\alpha)f_{AGN}(y;\mu,\sigma,\kappa,\beta), \end{align*}$

and the CDF of $Y$ is

$\begin{align*} F_{CAGN}(y; \mu,\sigma,\kappa,\lambda,\alpha,\beta) = \left\{\begin{aligned} \; \frac{\kappa^2}{(1+\kappa^2)}-\frac{\kappa^2}{(1+\kappa^2)}\bigg[\alpha\mathit{\boldsymbol{\gamma}}\left(\frac{1}{\beta},h_1(y)\right)+(1-\alpha)\mathit{\boldsymbol{\gamma}}\left(\frac{1}{\beta},h_2(y)\right)\bigg],&&\; \; {\rm if}\; y < \mu,\\ \; \frac{\kappa^2}{(1+\kappa^2)}+\frac{1}{(1+\kappa^2)}\bigg[\alpha\mathit{\boldsymbol{\gamma}}\left(\frac{1}{\beta},h_1(y)\right)+(1-\alpha)\mathit{\boldsymbol{\gamma}}\left(\frac{1}{\beta},h_2(y)\right)\bigg],&&\; \; {\rm if}\; y > \mu, \end{aligned}\right. \end{align*}$

where

$\begin{align*} h_1(y) = \frac{\lambda}{2}\bigg(\kappa^{{\mathrm{sign}}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^{\beta}\; ,\qquad h_2(y) = \frac{1}{2}\bigg(\kappa^{{\mathrm{sign}}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^{\beta}\; . \end{align*}$

Proposition 2.5. If $Y\sim CAGN(\mu, \sigma, \kappa, \lambda, \alpha, \beta)$ , the pdf of $U\mid Y = y$ for $0 < \lambda < 1$ is given by

$\begin{align*} \mathit{\mbox{P}}(U = \lambda\mid Y = y) = &\frac{\alpha f_{AGN}(y;\mu,\lambda^{-1/\beta}\sigma,\kappa,\beta)}{f_{CAGN}(y ;\mu,\sigma,\kappa,\lambda,\alpha,\beta)}\; ,\; \; \; \mathit{\mbox{P}}(U = 1\mid Y = y) = \frac{(1-\alpha)f_{AGN}(y;\mu,\sigma,\kappa,\beta)}{f_{CAGN}(y;\mu,\sigma,\kappa,\lambda,\alpha,\beta)}\; . \end{align*}$

Proof. See Appendix Part 1. □

illustrates the pdf of the GSMAGN family under three distinct scenarios, providing valuable insights into the impact of the shape parameters ( $\alpha$ and $\beta$ ) on the pdf's shape. The chosen values of $\alpha$ and $\beta$ exert a decisive influence on the characteristics of the density. When $\alpha$ and $\beta$ assume smaller values ( and ), the resulting densities exhibit longer tails and greater leptokurtosis. Conversely, larger values of $\alpha$ and $\beta$ () lead to contrasting effects. Notably, the shape parameter $\beta$ has a predominant impact on the distribution's shape, while the parameter $\alpha$ governs the behavior of the tail. Moreover, all three subfigures demonstrate that the AGSL distribution possesses thicker tails compared to the other three distributions. Consequently, the AGSL distribution exhibits a favorable advantage in modeling observations characterized by skewness and heavy tails.

Figure 1. PDF of GSMAGN family for different parameter values.

DownLoad: Full-Size Img PowerPoint

3. Order statistics

In the following, we provide the moments of order statistics generating from $GSMAGN(0, 1, \kappa, \boldsymbol{\alpha}, \beta)$ distribution under IID case.

3.1. Order statistics of AGN

Proposition 3.1. Let $X_1, X_2, \cdots, X_n$ be a random sample of size $n$ from $AGN(0, 1, \kappa, \beta)$ , and $X_{1:n}, X_{2:n}, \cdots, X_{n:n}$ are corresponding order statistics. The $m_{th}$ moments of the largest order statistic is given by

$\begin{align*} \mathrm{E}(X_{n:n}^m) = n\sum\limits_{i = 0}^{n-1}\sum\limits_{j = 0}^{\infty}\binom{n-1}{i}&\frac{2^{\frac{m}{\beta}}}{(1+\kappa^2)^n[\Gamma(1/\beta)]^{i+1}}[(-1)^{i+m}\kappa^{2n+m}+\kappa^{2(n-i-1)-m}]b_j^{(i)}\Gamma\bigg(\frac{m+i+1}{\beta}+j\bigg), \end{align*}$

where

$b_j = \frac{(-1)^j}{j!(\frac{1}{\beta}+j)},\; \; b_j^{(s)} = \sum\limits_{h = 0}^{j}b_h^{(s-1)}b_{j-h},\; \; b_j^{(1)} = b_j,\; \; b_j^{(0)} = \left\{\begin{aligned} 1,&&\; \; j = 0,\\ 0,&&\; \; j > 0, \end{aligned}\right.,\; \; j = 0,1,\cdots.$

Proof. See Appendix Part 2. □

The Lauricella function of type A (see, for example, Exton(1978)^[20]) is defined as

$\begin{align*} &{ F_{A}^{(n)}(a,b_1,\cdots;c_1,\ldots,c_n;x_1,\cdots,x_n)} = \sum\limits_{m_1 = 0}^{\infty}\cdots\sum\limits_{m_n = 0}^{\infty}\frac{a_{m_1+\cdots+m_n}(b_1)_{m_1}\cdots(b_n)_{m_n}}{(c_1)_{m_1}\cdots(c_n)_{m_n}}\frac{x_1^{m_1}\cdots x_n^{m_n}}{{m_1}!\cdots{m_n}!}, \end{align*}$

where $(f)_k = f(f+1)\cdots(f+k-1)$ denotes the ascending factorial.

We also acquire the moments of the largest order statistic by means of the Lauricella function of type A.

Proposition 3.2. If $X \sim AGN(0, 1, \kappa, \beta)$ , and $X_{1:n}, X_{2:n}, \cdots, X_{n:n}$ are the order statistics, then the $m_{th}$ moments of the largest order statistic is given by

$\begin{align*} \mathrm{E}(X_{n:n}^m) = &n\sum\limits_{i = 0}^{n-1}\binom{n-1}{i}\frac{\beta^i\Gamma(\frac{m+i+1}{\beta})2^{\frac{m}{\beta}}}{(1+\kappa^2)^n[\Gamma(1/\beta)]^{i+1}}[(-1)^{i+m}\kappa^{2n+m}+\kappa^{2(n-i-1)-m}]\\ &\times F_{A}^{(i)}\bigg(\frac{m+i+1}{\beta},\frac{1}{\beta},\cdots,\frac{1}{\beta};\frac{1}{\beta}+1,\cdots,\frac{1}{\beta}+1;-1,\cdots,-1\bigg). \end{align*}$

Proof. See Appendix Part 2. □

3.2. Order statistics of AGT

Proposition 3.3. If $X \sim AGT(0, 1, \kappa, \alpha, \beta)$ , $\frac{1}{\beta}$ is an integer, and $X_{1:n}, X_{2:n}, \cdots, X_{n:n}$ are the order statistics, then the $m_{th}$ moments of the largest order statistic $X_{n:n}$ is given by

$\begin{align*} &{ \mathrm{E}}(X_{n:n}^m) = n\frac{{{{(2/\beta )}^{\frac{m}{\beta }}}}}{{{{(1 + {\kappa ^2})}^n}}}\left[ {\frac{{{{( - 1)}^m}{\kappa ^{2n + m}}}}{{{{[ {\mathrm{B}}(\alpha ,1/\beta )]}^n}}}\sum\limits_{v = 0}^{(1/\beta - 1)(n - 1)} {{T_v}(1/} \beta ,n - 1) \times {\mathrm{B}}\left( {\frac{{m + 1}}{\beta },\alpha n + v - \frac{m}{\beta }} \right)} \right.\\ & \quad\quad \left. {\quad \quad + \sum\limits_{j = 0}^{n - 1} {{{( - 1)}^j}\left( {\begin{array}{*{20}{c}} {n - 1}\\ j \end{array}} \right)} \frac{1}{{{\kappa ^m}{{[ {\mathrm{B}}(\alpha ,1/\beta )]}^j}}}\sum\limits_{v = 0}^{(1/\beta - 1)j} {{T_v}(1/} \beta ,j) \times B\left( {\frac{{m + 1}}{\beta },\alpha (j + 1) + v - \frac{m}{\beta }} \right)} \right], \end{align*}$

where ${T_v}(1/\beta, 0) = 1$ , ${T_v}(1/\beta, 1) = {C_v}, \; v = 0, 1, \cdots, (1/\beta)$ , and $C_v = (-1)^v\binom{1/\beta-1}{v}\frac{1}{\alpha+v}$ . For $n > 2$ , ${T_v}\left({\frac{1}{\beta }, n - 1} \right) = \sum\limits_{i = 0}^v {{C_i}} {T_{v - i}}\left({\frac{1}{\beta }, n - 2} \right), v = 0, 1, \cdots, (1/\beta)(n-1).$

Proof. See Appendix Part 2. □

The formula for computing the m-th moments of the order statistics given in Proposition 3.3 is only applicable to the case that $1/\beta$ is an integer. We derive a general representation of the m-th moments of order statistics without restrictions on $\beta$ .

Introduced by Exton (1978) ^[20], the generalized Kampe de Feriet function is defined by

$\begin{align*} { F_{C:D}^{A:B}((a):(b_1);\cdots;(b_n);(c):(d_1);\cdots,(d_n);x_1,\cdots,x_n)} = & \sum\limits_{m_1 = 0}^{\infty}\cdots\sum\limits_{m_n = 0}^{\infty}\frac{((a))_{m_1+\cdots+m_n}((b_1))_{m_1}\cdots((b_n))_{m_n}}{((c))_{m_1+\cdots+m_n}((d_1))_{m_1}\cdots((d_n))_{m_n}}\\ &\times \frac{x_1^{m_1}\cdots x_n^{m_n}}{m_1!\cdots m_n!}, \end{align*}$

where $a = (a_1, a_2, \cdots, a_A)$ , $b_i = (b_{i, 1}, b_{i, 2}, \cdots, b_{i, B})$ , $c = (c_1, \cdots, c_C)$ , $d_i = (d_{i, 1}, \cdots, d_{i, D})$ , for $i = \; 1, 2, \cdots, n$ and $((f))_k = ((f_1, f_2, \cdots, f_p))_k = (f_1)_k\cdots(f_p)_k$ , $(f_i)_k = f_i(f_i+1)\cdots(f_i+k-1)$ .

By using the the generalized Kampe de Feriet function, we derive the following proposition:

Proposition 3.4. If $1/\beta$ is a real non-integer, the $m$ -th moments of order statistics $X_{n:n}$ from $AGT(0, 1, \kappa, \alpha, \beta)$ can be computed by the following convergent expression

$\begin{align*} \mathrm{E}(X_{n:n}^m) = \bigg[nJ(m,n-1,\kappa)+\sum\limits_{j = 0}^{n - 1} {{{( - 1)}^{m + j}}}\binom {n-1}{j}nJ(m,(j+1),1/\kappa)\bigg]\; , \end{align*}$

where

$\begin{align*} J(m,n-1,\kappa) = \frac{(-1)^mk^{2n+m}(2/\beta)^{\frac{m}{\beta}}{ {\mathrm{B}}}(\frac{m+1}{\beta},\alpha n-\frac{m}{\beta})}{(1+\kappa^2)^n[{ {\mathrm{B}}}(\alpha,1/\beta)]^na^{n-1}} F_{1:1}^{1:2}&\bigg((\alpha n-\frac{m}{\beta}):(1-\frac{1}{\beta},\alpha);\cdots;(1-\frac{1}{\beta},\alpha);\\ &(\alpha n+\frac{1}{\beta}):(\alpha+1);\cdots,(\alpha+1);1,\cdots,1\bigg). \end{align*}$

Proof. See Appendix Part 2. □

3.3. Order statistics of CAGN

The relationship between order statistics from CAGN distribution and AGN distribution can be stated as follows

Proposition 3.5. If $Y\sim CAGN(0, 1, \kappa, \lambda, \alpha, \beta)$ , $X\sim AGN(0, 1, \kappa, \alpha, \beta)$ , then we have

$\begin{align*} { \mathrm{E}}(Y_{r:n})& = \mathrm{P}(U = 1){ \mathrm{E}}(X_{r:n})+ \mathrm{P}(U = \lambda)\lambda^{-\frac{1}{\beta}}{ \mathrm{E}}(X_{r:n}) = [\alpha+\lambda^{-\frac{1}{\beta}}(1-\alpha)]{ \mathrm{E}}(X_{r:n}). \end{align*}$

Proof. See Appendix Part 2. □

4. Parameter estimation via the EM-type algorithm

In this section, an EM-type algorithm to determine the MLEs for the parameters of the $GSMAGN(\mu, \sigma, \kappa, \boldsymbol{\alpha}, \beta)$ family is established.

Originally proposed by Dempster et al. (1977) ^[21], the EM algorithm has garnered significant attention as a powerful tool for handling estimation challenges in situations involving incomplete data. This algorithm offers numerous exceptional advantages. For a more comprehensive understanding of this subject, we highly recommend referring to the monograph by Lange (2013) ^[22]. A notable extension of the EM algorithm is the ECM algorithm introduced by Meng and Rubin (1993) ^[23]. The ECM algorithm not only preserves the desirable property of monotonic convergence exhibited by the Q-function in the classical EM algorithm but also facilitates faster convergence rates.

Invoking Proposition 2.1, the hierarchical representation for the GSMAGN model can be described as

$\begin{align*} Y_j\mid U_j = u_j&\sim AGN\bigg(\mu,u_j^{-1/\beta}\sigma,\kappa,\beta\bigg),\; \; U_j\sim H(u_j;\boldsymbol{\alpha}). \end{align*}$

Let $Y = (y_1, \ldots, y_n)^T$ denote the observed data, $U = (u_1, \ldots, u_n)^T$ on behalf of the corresponding latent variable. Combining $Y$ , $U$ together we obtain the complete data denoted by $W = (Y, U)$ . Let $\varTheta = (\mu, \sigma, \kappa, \boldsymbol{\alpha})^T$ represent the vector of parameters, then the complete log-likelihood function can be expressed as follows

$\begin{align} \begin{split} \ln L(\varTheta\mid W) = & n\bigg[\ln\kappa-\ln(1+\kappa^2)+\ln\beta-(1/\beta)\ln2-\ln\Gamma(1/\beta)-\ln\sigma\bigg]\\ &+\frac{1}{\beta}\sum\limits_{j = 1}^n\ln u_j-\frac{1}{2}\sum\limits_{j = 1}^{n}\bigg(\kappa^{ \mathrm{sign}(y_j-\mu)}\frac{|y_j-\mu|}{\sigma}\bigg)^\beta u_j+\sum\limits_{j = 1}^{n}\ln h(u_j;\boldsymbol{\alpha}). \end{split} \end{align}$

(4.1)

Prior to outlining the precise steps of the estimation algorithm, it is imperative to highlight a crucial aspect. We adopt the assumption that the shape parameter $\beta$ remains constant, and the optimal value for this parameter can be determined through the PLA. By integrating the ECM algorithm with the PLA, the computational speed and overall efficiency of the algorithm are significantly enhanced. Consequently, under this assumption, the parameter vector $\theta_i$ is transformed into $\theta_i = (\mu_i, \sigma_i, \alpha_i)$ throughout the subsequent sections of this paper.

It is well known that the EM framework is an iterative algorithm consisting of two steps, namely, the expectation step (E-step) and the maximization step (M-step).

E-step: Given the observed data set $Y$ and parameter estimation values ${\hat{\Theta}}^{(h)}$ of the $h$ -th iteration, the aim of this step involves calculation of several necessary conditional expectations. The so-called Q-function is expressed by

$\begin{align} \begin{split} { Q(\varTheta\mid \hat{\varTheta}^{(h)})} = &{ \mbox{E}[\ln L(\varTheta\mid W)\mid Y,\hat{\varTheta}^{(h)}]}\\ = & n\bigg[\ln\kappa-\ln(1+\kappa^2)+\ln\beta-(1/\beta)\ln2-\ln\varGamma(1/\beta)-\ln\sigma\bigg]\\ &+\frac{1}{\beta}\sum\limits_{j = 1}^{n}A_{2j}(Y,\hat{\varTheta}^{(h)})- \frac{1}{2}\sum\limits_{j = 1}^{n}\bigg(\kappa^{ \mathrm{sign}(y_j-\mu)}\frac{|y_j-\mu|}{\sigma}\bigg)^\beta A_{1j}(Y,\hat{\varTheta}^{(h)})+\sum\limits_{j = 1}^{n}B_{j}(Y,\hat{\varTheta}^{(h)})\; , \end{split} \end{align}$

(4.2)

where $A_{1j}(Y, \hat{\varTheta}^{(h)}) = \mbox{E}[U_j\mid Y, \hat{\varTheta}^{(h)}]$ , $A_{2j}(Y, \hat{\varTheta}^{(h)}) = \mbox{E}[\ln U_j\mid Y, \hat{\varTheta}^{(h)}]$ , and $B_{j}(Y, \hat{\varTheta}^{(h)}) = \mbox{E}[\ln h(U_j; \boldsymbol{\alpha})\mid Y, \hat{\varTheta}^{(h)}]$ .

For different members of the GSMAGN family, the crucial conditional expectations are calculated as follows

● For AGT

$\begin{align*} A_{1j}(Y,\hat{\varTheta}^{(h)})& = \frac{(\hat{\alpha}^{(h)}+1/\beta)}{1/\beta+\hat{c}_{j}^{(h)}}\; ,\\ A_{2j}(Y,\hat{\varTheta}^{(h)}) & = \varphi\bigg(\hat{\alpha}^{(h)}+1/\beta\bigg) -\ln\bigg(1/\beta+\hat{c}_{j}^{(h)}\bigg)\; , \\ B_{j}(Y,\hat{\varTheta}^{(h)})& = -[\alpha\ln\beta+\ln\Gamma(\alpha)]-\frac{1}{\beta}A_{1j}(Y,\hat{\varTheta}^{(h)})+(\alpha-1)A_{2j}(Y,\hat{\varTheta}^{(h)})\; , \end{align*}$

where $\hat{c}_{j}^{(h)} = \frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(y_j-\hat{\mu}^{(h)})}\frac{|y_j-\hat{\mu}^{(h)}|}{\hat{\sigma}^{(h)}}\bigg)^{\beta}$ , and $\varphi(\cdotp)$ is the Digamma function.

● For AGSL

$\begin{align*} A_{1j}(Y,\hat{\varTheta}^{(h)}) & = \frac{\boldsymbol{\gamma}(\hat{\alpha}^{(h)}+1/\beta+1;\hat{c}_j^{(h)})(\hat{\alpha}^{(h)}+1/\beta)}{\boldsymbol{\gamma}(\hat{\alpha}^{(h)}+1/\beta;\hat{c}_{j}^{(h)})\hat{c}_{j}^{(h)}}\; ,\\ A_{2j}(Y,\hat{\varTheta}^{(h)}) & = \bigg\{\frac{\int_{0}^{\hat{c}_{j}^{(h)}}t^{(\hat{\alpha}^{(h)}+1/\beta-1)}e^{-t}\ln t\; \mbox{d}t}{\Gamma(\hat{\alpha}^{(h)}+1/\beta)\boldsymbol{\gamma}(\hat{\alpha}^{(h)}+1/\beta;\hat{c}_{j}^{(h)})}-\ln\hat{c}_{j}^{(h)}\bigg\}\; ,\\ B_{j}(Y,\hat{\varTheta}^{(h)}) & = \ln\alpha-(\alpha-1)A_{2j}(Y,\hat{\varTheta}^{(h)})\; . \end{align*}$

● For CAGN

$\begin{align*} A_{1j}(Y,\hat{\varTheta}^{(h)}) = & \frac{1-\hat{\alpha}^{(h)}+\hat{\alpha}^{(h)}[\hat{\lambda}^{(h)}]^{1+1/\beta}\exp\bigg(\frac{(1-\hat{\lambda}^{(h)})}{2}\hat{c}_{j}^{(h)}\bigg)}{1-\hat{\alpha}^{(h)}+\hat{\alpha}^{(h)}\hat{\lambda}^{(h)}\exp\bigg(\frac{(1-\hat{\lambda}^{(h)}}{2})\hat{c}_{j}^{(h)}\bigg)}\; ,\\ A_{2j}(Y,\hat{\varTheta}^{(h)}) = &\frac{\hat{\alpha}^{(h)}[\hat{\lambda}^{(h)}]^{1/\beta}\ln\hat{\lambda}^{(h)}}{\hat{\alpha}^{(h)}[\hat{\lambda}^{(h)}]^{1/\beta}+(1-\hat{\alpha}^{(h)})\exp\bigg(\frac{(\hat{\lambda}^{(h)}-1)}{2}\hat{c}_{j}^{(h)}\bigg)},\\ B_{j}(Y,\hat{\varTheta}^{(h)}) = &\ln\alpha\frac{\hat{\alpha}^{(h)}f_{AGN}(y_j;\hat{\mu}^{(h)},[\hat{\lambda}^{(h)}]^{-1/\beta}\hat{\sigma}^{(h)},\beta)}{f_{CAGN}(y_j;\hat{\mu}^{(h)},\hat{\sigma}^{(h)},\hat{\lambda}^{(h)},\hat{\alpha}^{(h)},\beta)}+\ln(1-\alpha)\frac{(1-\hat{\alpha}^{(h)})f_{AGN}(y_j;\hat{\mu}^{(h)},\hat{\sigma}^{(h)},\beta)}{f_{CAGN}(y_j;\hat{\mu}^{(h)},\hat{\sigma}^{(h)},,\hat{\lambda}^{(h)},\hat{\alpha}^{(h)},\beta)}\; . \end{align*}$

In the EM algorithm, the M-step needs to maximize the conditional expectation obtained by the E-step and the suggested framework can be briefly described as follows

(1) According to the theory of robust statistics ^[24], $\hat{\mu}^{(h+1)}$ can be updated through the following expression

$\begin{equation} \hat{\mu}^{(h+1)} = \sum\limits_{j = 1}^{n}\frac{\omega_{j}(Y,\hat{\varTheta}^{(h)})y_j}{\sum_{l = 1}^{n}\omega_{l}(Y,\hat{\varTheta}^{(h)})}\; , \end{equation}$

(4.3)

where $\omega_{j}(Y, \varTheta) = { \kappa^{\beta \mathrm{sign}(y_j-\mu)}|y_j-\mu|^{\beta-2}A_{1j}(Y, \varTheta)},$

(2) Update $\hat{\sigma}^{(h+1)}$ by the following equation

$\begin{equation} { \hat{\sigma}^{(h+1)}} = { \bigg(\frac{\beta}{2n}\bigg)^{\frac{1}{\beta}}\bigg[\sum\limits_{j = 1}^{n}\omega_{j}(Y,\hat{\varTheta}^{\ast(h)})|y_j-\hat{\mu}^{(h+1)}|^{2} \bigg]^{\frac{1}{\beta}}}\; , \end{equation}$

(4.4)

where $\hat{\varTheta}^{\ast(h)} = (\hat{\mu}^{(h+1)}, \hat{\sigma}^{(h)}, \hat{\kappa}^{(h)}, \hat{\mathit{\boldsymbol{\alpha}}}^{(h)})$ .

(3) Update $\hat{\kappa}^{(h+1)}$ by the following equation

$\begin{equation} \hat{\kappa}^{(h+1)} = { \bigg\{\frac{\sum_{j = 1}^{n}([y_j-\hat{\mu}^{(h+1)}]^-)^\beta A_{1j}(Y,\hat{\varTheta}^{\ast\ast(h)})}{\sum_{j = 1}^{n}([y_j-\hat{\mu}^{(h+1)}]^+)^\beta A_{1j}(Y,\hat{\varTheta}^{\ast\ast(h)}))}\bigg\}^{1/2(\beta+1)}}\; , \end{equation}$

(4.5)

such that $\varTheta^{\ast\ast(h)} = (\hat{\mu}^{(h+1)}, \hat{\sigma}^{(h+1)}, \hat{\kappa}^{(h)}, \hat{\mathit{\boldsymbol{\alpha}}}^{(h)})$ and $[x]^+ = max\{x, 0\}$ , $[x]^- = [-x]^+$ .

(4) Regarding $\hat{\alpha}^{(h+1)}$

● For AGT: Update $\hat{\alpha}^{(h+1)}$ as the root of

$\begin{align} \bigg[\; \varphi(\alpha)+\ln\beta\; \bigg]-\frac{1}{n}\sum\limits_{j = 1}^{n}A_{2j}(Y,\hat{\varTheta}^{\ast\ast\ast(h)}) = 0, \end{align}$

(4.6)

where $\hat{\varTheta}^{\ast\ast\ast(h)} = (\hat{\mu}^{(h+1)}, \hat{\sigma}^{(h+1)}, \hat{\kappa}^{(h+1)}, \hat{\mathit{\boldsymbol{\alpha}}}^{(h)})$ .

● For AGSL: Update $\hat{\alpha}^{(h+1)}$ by

$\begin{align} \hat{\alpha}^{(h+1)} = -\frac{n}{\sum_{j = 1}^{n}A_{2j}(Y,\hat{\varTheta}^{\ast\ast\ast(h)})}. \end{align}$

(4.7)

● For CAGN: Update $\hat{\alpha}^{(h+1)}$ by

$\begin{align} \hat{\alpha}^{(h+1)} = \sum\limits_{j = 1}^{n}\frac{\hat{\alpha}^{(h)}[\hat{\lambda}^{(h)}]^{1/\beta}}{\hat{\alpha}^{(h)}[\hat{\lambda}^{(h)}]^{1/\beta}+(1-\hat{\alpha}^{(h)})\exp\bigg(\frac{(\hat{\lambda}^{(h)}-1)}{2}\hat{c}_{j}^{(h)}\bigg)}\; . \end{align}$

(4.8)

Remark 4.1. For AGN, an interative procedure presented below to update $\hat{\beta}^{(h+1)}$ is also recommended

$\begin{align} \hat{\beta}^{(h+1)} = \hat{\beta}^{(h)}-\frac{\partial Q(\varTheta\mid \hat{\varTheta}^{\ast\ast\ast(h)})}{\partial\beta}\bigg(\frac{\partial^2 Q(\varTheta\mid\hat{\varTheta}^{\ast\ast\ast(h)})}{\partial\beta^2}\bigg)^{-1}\; , \end{align}$

(4.9)

where

$\begin{align*} \frac{\partial Q(\varTheta\mid\hat{\varTheta}^{\ast\ast\ast(h)})}{\partial\beta} = &n\bigg[\frac{1}{\beta}+\frac{\ln2}{\beta^2}+\frac{\varphi(1/\beta)}{\beta^2}\bigg]-n\hat{c}_{ij}^{(h)}\bigg(\ln2+\frac{1}{\beta}\ln\hat{c}_{ij}^{(h)}\bigg)\; ,\\ \frac{\partial^2 Q(\varTheta\mid\hat{\varTheta}^{\ast\ast\ast(h)})}{\partial\beta^2} = &-n\bigg[\frac{1}{\beta}+\frac{2\ln2}{\beta^2}+\frac{2\varphi(1/\beta)}{\beta^3}+\frac{2\varphi'(1/\beta)}{\beta^4}\bigg]-n\hat{c}_{j}^{(h)}\bigg(\ln2+\frac{2}{\beta}\ln\hat{c}_{j}^{(h)}\bigg)\; . \end{align*}$

For a prescribed value $\varepsilon > 0$ , if the value of the actual log-likelihood function $L(Y\mid\varTheta)$ of two successive iterations statifies $|L(Y\mid\hat{\varTheta}^{(h+1)})/L(Y\mid\hat{\varTheta}^{(h)})-1| < \varepsilon$ , then the iterative process is broken up. In our study, the tolerance $\varepsilon$ is set equal to $10^{-5}$ .

It is well known that the starting value plays a decisive role in the implementation of the EM-type algorithm. As a result, the following steps are recommended to find reasonable starting value.

Notes on implementation

● Random sample generation: From the stochastic representation in Proposition 2.1, the following procedures are recommended to generate random samples from GSMAGN.

Step 1: Generate $U\sim H(u; \boldsymbol{\alpha})$ and set $Z\sim Gamma(1/\beta, 1)$ .

Step 2: Generate $W\sim U(0, 1)$ . If $W\leq\frac{1}{1+\kappa^2}$ , set $I = \frac{1}{\kappa}$ ; else set $I = -\kappa$ .

Step 3: Set $Y = \mu+\sigma U^{-\frac{1}{\beta}}I (2Z)^{\frac{1}{\beta}}$ and return $Y$ .

● Initialization: The initial value for $\hat{\mu}^{(0)}$ , $\hat{\sigma}^{(0)}$ and $\hat{\kappa}^{(0)}$ can be computed by

$\hat{\mu}^{(0)} = \mathrm{mode}(Y),\; \; \; \hat{\sigma}^{(0)} = \sqrt{\frac{\sum_{j = 1}^{n}(y_j-\hat{\mu}^{(0)})^2}{n}},\; \; \; \hat{\kappa}_l^{(0)} = \sqrt{\frac{\sum_{j = 1}^{n}\mbox{I}_{(y_j < \hat{\mu}^{(0)})}(y)}{\sum_{i = 1}^{n}\mbox{I}_{(y_j > \hat{\mu}^{(0)})}(y)}}.$

(1) For AGT and AGSL: Compute the initial guess $\hat{\alpha}^{(0)}$ through the following equation

$\hat{\alpha}^{(0)} = {\mathop{\arg\max}_{\alpha}\sum\limits_{j = 1}^{n}\ln f(y_j;\hat{\mu}^{(0)},\hat{\sigma}^{(0)},\hat{\kappa}^{(0)},\mathit{\boldsymbol{\alpha}},\beta)\; .}$

(2) For CAGN: The initial value for $\alpha$ and $\lambda$ are uniformly generated in $(0, 1)$ .

Choosing the value of $\beta$

Step 1: Run the AGN model and obtain an estimate $\tilde{\beta}$ that can be referenced, then consider grids of values $\{\check{\beta}_{(1)}, \ldots, \check{\beta}_{(R)}\}$ that contain $\tilde{\beta}$ (the initial value for $\beta$ can be obtained from the moment estimator).

Step 2: Substitute each $\beta_{(q)}$ into the Q-function to acquire simultaneous estimation for the unknown parameters denoted by $\hat{\theta}^{(q)}_i$ via the ECM algorithm. After that, calculate the value of the corresponding log-likelihood function through Eq (4.1).

Step 3: Repeat Step 2 and then interpolate on $\beta$ values and derive $\breve{\beta}$ which maximizes the trace of the value of the log-likelihood function.

5. Consistency properties

5.1. Existence and consistency of the MLE

Lemma 5.1. If $X\sim{\mathrm{d}}P_\theta(x) = f(x; \theta){\mathrm{d}}\mu(x), \theta = (\theta_1, \cdots, \theta_s), x = (x_1, \cdots, x_n)$ are a $n$ identically distributed $f(x; \theta)$ random variable, then parameter space $\Theta$ is an open set in $\mathbb{R}^s$ . Assume that

(1) If $\theta_1 \neq \theta_2$ , then $P_{\theta_1} \neq P_{\theta_2}$ .

(2) For $\forall x$ in the range of $X$ , $f(x, \theta) > 0$ , and has a continuous first partial derivative with respect to $\theta$ .

(3) For $\forall \theta_0 \in \Theta$ , there exists a neighborhood $U_{\theta_0} \subset \Theta$ and a positive function $G(x)$ if $\theta, \theta^{'}$ in $U_{\theta_0}$ , we have $\mid{\mathrm{ln}}f(x; \theta^{'})-{\mathrm{ln}}f(x; \theta)\mid\leq G(x)\parallel\theta^{'}-\theta\parallel$ , and ${\mathrm{E}}_{\theta_0}[G(X)] < \infty.$

For $\theta_0 \in \Theta$ , when $n\rightarrow \infty$ , there exists a solution $\hat{\theta}$ to likelihood equation (a.s.)

$\frac{\partial{\mathrm{ln}}L(x;\theta)}{\partial\theta}: = \sum\limits_{i = 1}^{n}\frac{\partial{\mathrm{ln}}f(x_i;\theta)}{\partial\theta},$

and the solution $\hat{\theta}$ is strongly consistent.

Theorem 5.1. If $Y\sim GSMAGN(\mu, \sigma, \kappa, \alpha, \beta)$ , $\mathbf{y} = (y_1, y_2, \ldots, y_n)$ is random sample from population $Y$ , where $\theta \in \tilde{\Theta} = \{(\mu, \sigma, \kappa, \alpha, \beta): \mu \in \mathbb{R}, \sigma \in \mathbb{R}^{+}, \kappa \in \mathbb{R}^{+}, \alpha \in \mathbb{R}^{+}, \beta \in \mathbb{R}^{+}\}$ , the maximum likelihood estimator $\hat{\theta}$ of $\theta$ satisfies

$P_{\theta}(\lim\limits_{n\rightarrow \infty}\hat{\theta} = \theta) = 1, \forall \theta \in \tilde{\Theta}.$

Proof. See Appendix Part 3. □

5.2. Estimation of standard errors

The observed information matrix of the GSMAGN distribution is computed by

$\mathbf{I_0(\Theta\mid y)} = -\partial^2L(\mathbf{(\Theta\mid y)})/\partial\mathbf{\Theta}\partial\mathbf{\Theta}^{T}.$

It is well known that, under some regularity conditions, the covariance matrix of the maximum likelihood estimators $\mathbf{\hat{\Theta}}$ can approximated by the inverse of $\mathbf{I_0(\Theta|y)}$ . We evaluate

$\begin{align} \mathbf{I_0(\Theta\mid y)} = \sum\limits_{i = 1}^{n}\mathbf{\hat{s}}_i\mathbf{\hat{s}}_i^{T}. \end{align}$

(5.1)

Following the results of Basso^[5], the individual score can be described as $\mathbf{\hat{s}}_i = \frac{\partial L(\Theta|y_i)}{\partial \Theta}|_{\Theta = \hat\Theta}$ , where $\mathbf{\hat{s}}_i = ({\hat{s}}_{\mu}, {\hat{s}}_{\sigma}, {\hat{s}}_{\kappa}, {\hat{s}}_{\boldsymbol{\alpha}})$ . The elements in $\mathbf{\hat{s}}_i$ are given by

${\hat{s}}_{i,\mu} = \frac{D_{\mu}(f(y_i;\Theta))}{f(y_i;\Theta)},\; \; {\hat{s}}_{i,\sigma} = \frac{D_{\sigma}(f(y_i;\Theta)) }{f(y_i;\Theta)},\; \; {\hat{s}}_{i,\kappa} = \frac{D_{\kappa}(f(y_i;\Theta))}{f(y_i;\Theta)},\; \; {\hat{s}}_{i,\boldsymbol{\alpha}} = \frac{D_{\nu}(f(y_i;\Theta))}{f(y_i;\Theta)}.$

$D_{\mu}(f(y_i; \Theta))$ denotes the derivation of the density function with respect to $\mu$ , and $D_{\sigma}(f(y_i; \Theta))$ , $D_{\kappa}(f(y_i; \Theta))$ and $D_{\boldsymbol{\alpha}}(f(y_i; \Theta))$ are defined by analogy. Let

$M_{i,k} = \int_0^{\infty}u^{\frac{1}{\beta}+k}\exp\bigg\{-\frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(y_i-\mu)}\frac{|y_i-\mu|}{\sigma}\bigg)^\beta u\bigg\} \mathrm{d}H(u;\boldsymbol{\alpha}).$

After some algebraic computation, let $A_i = \kappa^{ \mathrm{sign}(y_i-\mu)}\frac{|y_i-\mu|}{\sigma}$ and we obtain

$\begin{align*} s_{i,\mu}& = \frac{\beta}{2\sigma} \mathrm{sign}(\mu-y_i)\kappa^{ \mathrm{sign}(y_i-\mu)}{A_i}^{\beta-1}\frac{M_{i,1}}{M_{i,0}},\\ s_{i,\sigma}& = -\frac{1}{\sigma}+\frac{\beta}{2\sigma}{A_i}^{\beta}\frac{M_{i,1}}{M_{i,0}},\\ s_{i,\kappa}& = \frac{1-\kappa^2}{\kappa(1+\kappa^2)}-\frac{\beta}{2\kappa} \mathrm{sign}(y_i-\mu){A_i}^{\beta}\frac{M_{i,1}}{M_{i,0}}. \end{align*}$

$D_{\boldsymbol{\alpha}}(f(y_i; \Theta))$ depends on each particular case mentioned in section 2. The components of the score vector are given in Appendix Part 3.

6. Simulation study

In this section, we conduct Monte Carlo simulations and the performance of the proposed algorithm is assessed via Bias and mean squared error (MSE) under different censoring schemes and parameter values. These two measures will be computed with 1500 replications at the same time and they are defned as follows:

$Bias(\theta_i) = \frac{1}{1500}\sum\limits_{j = 1}^{1500}(\hat\theta_i-\theta_i),\; \; \; \; MSE(\theta_i) = \frac{1}{1500}\sum\limits_{j = 1}^{1500}(\hat\theta_i-\theta_i)^2.$

Without losing the generality, we consider sample sizes $n = 50,100,300,500,800, 1000$ , and change the values of the shape parameter $\beta$ . To be more specific, in order to study the influence of sample variability on the estimation effects of proposed method, we consider the cases where $\beta = 1.5$ and $\beta = \; 2.5$ , and for each case of $\beta$ , the remaining parameter $(\mu, \sigma, \kappa, \alpha)$ is taken to be equal to (10, 2.5, 1.5, 3.5).

Figures 2 and 3 exhibit the Bias and MSE plots for the parameter estimates of the AGN, AGT, AGSL, and CAGN distributions across varying sample sizes. Based on the observations drawn from these two pictures, the following findings can be elucidated:

Figure 2. Bias and MSE of

$(\mu, \sigma, \kappa, \beta, \alpha)(\mu = 10, \sigma = 2.5, \kappa = 1.5, \beta = 1.5, \alpha = 3.5)$ .

DownLoad: Full-Size Img PowerPoint

Figure 3. Bias and MSE of

$(\mu, \sigma, \kappa, \beta, \alpha)(\mu = 10, \sigma = 2.5, \kappa = 1.5, \beta = 2.5, \alpha = 3.5)$ .

DownLoad: Full-Size Img PowerPoint

(1) It is evident that both the Bias and MSE of the parameters for the different sub-models (AGN, AGT, AGSL, CAGN) diminish and converge to zero as the sample sizes increase. This convergence implies that the estimates obtained through the proposed EM-type algorithm adhere to the desirable asymptotic properties, signifying consistency.

(2) Notably, when employing the proposed EM-type algorithm, the parameters $\kappa$ and $scale$ demonstrate relatively minor Bias across all sample sizes. Additionally, the MSE of the $\kappa$ parameter exhibits significant sensitivity to changes in sample size.

(3) Furthermore, Figures 2 and 3 reveal that the estimated performance of the EM algorithm for the AGT and AGSL distributions closely approximates that of these two distributions.

(4) Conversely, the results indicate that the estimation method faces the challenge of dependency on distribution complexity. Comparing the maximum likelihood estimates of the four distributions within the GSMAGN family, it becomes evident that the EM-type algorithm effectively estimates the parameters of the AGN distribution. However, for the CAGN distribution, which possesses a relatively larger number of parameters, the Bias and MSE of $\sigma$ and $\beta$ are relatively substantial, resulting in less stable estimates.

Through a comparative analysis of and , it becomes apparent that as the value of $\beta$ increases, the Bias and MSE of the parameter estimates decrease for all four distributions overall. This observation indicates that the EM-type algorithm for maximum likelihood estimation is more suitable for distributions with lower leptokurtosis.

7. Application

In order to facilitate the empirical investigation, datasets encompassing the S&P 500 Composite Index (S&P 500) and the Shanghai Stock Exchange Composite Index (SSEC) indices, spanning from January 2, 1998 to June 18, 2023 were diligently compiled. These datasets were acquired from the esteemed source, https://finance.yahoo.com. A total of 6,408 samples were drawn from the S&P 500 and SSEC indices at corresponding time points to enable a comprehensive comparative analysis. Within the scope of this study, the return ( $R_t$ ) in period $t$ is defined as $R_t = \; 100(\ln P_t - \ln P_{t-1})$ , where $P_t$ (for $t = 1, 2, \dots, N$ ) represents the level of the S&P 500 and SSEC indices at time $t$ .

As elucidated by Wen et al. (2022) ^[25], notwithstanding divergent summary statistics between the S&P 500 and SSEC indices, both datasets deviate from the assumptions associated with a normal distribution. Furthermore, both sets of data exhibit characteristics such as heavy-tailedness, leptokurtosis, and left-skewness.

To assess the adequacy of the proposed GSMAGN family, fitting procedures were conducted for the AGN, AGT, AGSL, and CAGN distributions using the S&P 500 and SSEC datasets separately. The estimation of distribution parameters was executed using the EM-type algorithm, as expounded in section four. To evaluate the goodness of fit for the four distributions and various metrics, including observed log-likelihood (l), Akaike information criterion (AIC), Bayesian information criterion (BIC), and the Efficient determination criterion (EDC), were employed. The fitting results were consolidated and presented in (for the S&P 500 dataset) and (for the SSEC dataset). The ultimate values of the shape parameter $\beta$ of the GSMAGN family were determined by the trace of the PLA, as presented in Figures 4 and 5.

Table 1. Parameter estimates for the S&P 500 data set and the associated AIC, BIC, EDC, and log-likelihood (

$l$ ) values for different distributions.

Model	$\mu$	$\sigma$	$\kappa$	$\alpha$	$\beta$	$\gamma$	$\lambda$	$\nu$	AIC	BIC	EDC	$l$
AGN	0.113	0.755	1.056	––	––	1.374	––	––	19695.64	19722.7	19751.65	-9843.819
AGT	0.105	1.592	1.049	3.212	1.319	––	––	––	19215.9	19249.72	19285.91	-9602.951
AGSL	0.0847	0.376	1.038	3.055	1.136	––	––	––	19227.94	19261.77	19298.01	-9608.971
CAGN	0.108	0.508	1.053	0.131	1.248	0.293	––	––	19237.83	19278.43	19321.92	-9612.917
SN	0.859	2.248	––	––	––	––	-0.999	––	20908.13	20929.02	20950.77	-10451.36
ST	0.258	0.577	––	––	––	––	-0.298	2.79	19275.44	19302.5	19331.49	-9633.719
SCN	0.295	0.59	––	––	––	0.215	-0.342	0.108	19432.05	19465.88	19502.12	-9711.05
BST	0.139	0.737	––	1.321	1.455	––	––	––	19274.54	19301.6	19346.61	-9633.269
Note: Bold font highlights the best model according to the model selection criteria.

| Show Table

DownLoad: CSV

Table 2. Parameter estimates for the SSEC data set and the associated AIC, BIC, EDC, and log-likelihood (

$l$ ) values for different distributions.

Model	$\mu$	$\sigma$	$\kappa$	$\alpha$	$\beta$	$\gamma$	$\lambda$	$\nu$	AIC	BIC	EDC	$l$
AGN	0.105	0.933	1.046	––	––	1.391	––	––	21538.4	21565.31	21593.23	-10765.2
AGT	0.0814	1.973	1.028	3.053	1.368	––	––	––	21171.13	21204.76	21239.76	-10580.56
AGSL	0.0779	0.46	1.027	2.932	1.138	––	––	––	21169.24	21202.87	21237.78	-10579.62
CAGN	0.074	0.569	1.024	0.159	1.167	0.375	––	––	21161.66	21202.02	21243.9	-10574.83
SN	1.042	3.314	––	––	––	––	-1.024	––	22450.91	22471.09	22492.03	-11222.46
ST	0.206	0.889	––	––	––	––	-0.19	2.868	21215.86	21242.77	21270.69	-10604.93
SCN	0.239	0.821	––	––	––	0.254	-0.227	0.122	21291.59	21328.22	21360.12	-10642.8
BST	0.099	0.928	––	1.39	1.466	––	––	––	21232.15	21259.06	21318.45	-10612.07
Note: Bold font highlights the best model according to the model selection criteria.

| Show Table

DownLoad: CSV

Figure 4. Trace of the PLA of the parameter

$\beta$ for fitting the S&P 500 dataset GSMAGN family.

DownLoad: Full-Size Img PowerPoint

Figure 5. Trace of the PLA of the parameter

$\beta$ for fitting the SSEC dataset GSMAGN family.

DownLoad: Full-Size Img PowerPoint

Upon scrutinizing Table 1, with a focus on the AIC and BIC criteria, it is discerned that the AGSL distribution consistently outperforms the other three distributions in fitting the S&P 500 dataset. Subsequently, the AGT distribution emerges as the next best-fitting alternative, indicating its substantial advantage when fitting data characterized by skewness and leptokurtic behavior. Turning attention to Table 2, considering the AIC, BIC, EDC, and log-likelihood values, it is deduced that the GSMAGN family surpasses the alternative models in terms of goodness of fit for the SSEC dataset.

To visually illustrate the fitting results, Figure 6 depicts histograms of the datasets alongside corresponding fitted densities. Notably, the fitted curves of AGT, AGSL and CAGN closely align with the contours of the histograms, attesting to the model's proficiency in capturing pertinent information inherent in the observations.

Figure 6. Histogram of S&P500 and SSEC data sets with fitted densities.

DownLoad: Full-Size Img PowerPoint

8. Conclusions

This paper introduced a distribution family named GSMAGN, which holds significant relevance in the fitting of complex data characterized by skewness and heavy tails. The study focused on examining the properties of this family and deriving the explicit expression for the moment of order statistics. This expression serves as a valuable tool for parameter estimation based on the order statistics. To enhance the efficiency of parameter estimation for this model, an alternative variant of the EM-type algorithm was established. This variant combined the PLA with the classical ECM algorithm. The proposed method not only improves computational speed but also provides efficient estimations. Furthermore, closed-form expressions of the information matrix, along with the corresponding asymptotic properties of the MLE, were presented for this distribution family. To assess the performance of the proposed estimation method, Monte Carlo simulations were conducted under various scenarios. The results demonstrated that the proposed method satisfies the asymptotic property, and the estimation accuracy improves with larger sample sizes. The feasibility of the proposed methodology was illustrated through two real datasets. The encouraging results obtained in this study motivate us to further delve into the realm of multivariate GSMAGN family for effectively modeling datasets exhibiting asymmetry and heavy-tailedness in the future.

Use of AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Conflict of interest

All authors declare no conflicts of interest in this paper.

Appendix

Part 1: Proofs required for section two

$\bullet$ Proof of Proposition 2.2

According to Proposition 2.1, $Y = \mu+\sigma U^{-\frac{1}{\beta}}I (2Z)^{\frac{1}{\beta}}$ , and $U$ , $I$ , $Z$ are independent of each other, then using the binomial expansion, we can obtain

$\begin{align*} \mathrm{E}(Y^m) = \sum\limits_{i = 0}^{m}\binom{m}{i}\sigma^i\mu^{m-i}2^{\frac{i}{\beta}} \mathrm{E}(I^i) \mathrm{E}(Z^{\frac{i}{\beta}}) \mathrm{E}(U^{-\frac{i}{\beta}}), \end{align*}$

where $Z\sim Gamma(1/\beta, 1)$ , then we can calculate that

$\mathrm{E}(Z^{\frac{i}{\beta}}) = \frac{1}{\varGamma(1/\beta)}\int_0^{\infty}z^{{\frac{i+1}{\beta}}}e^{-z}\text{d}z = \frac{\varGamma((i+1)/\beta)}{\varGamma(1/\beta)},$

and $I$ is a discrete random variable with density function $f_I = \frac{\kappa^2}{1+\kappa^2}\mathbf{1}_{\{I = -\kappa\}}+\frac{1}{1+\kappa^2}\mathbf{1}_{\{I = 1/\kappa\}}$ . Thus the $i$ th moments of $I$ are

$\mathrm{E}(I^i) = (-\kappa)^i\frac{\kappa^2}{1+\kappa^2}+\frac{1}{\kappa^i}\frac{1}{1+\kappa^2} = \frac{1}{1+\kappa^2}\bigg[\frac{1}{\kappa^i}+(-1)^i\kappa^{2+i}\bigg].$

By substituting $\mathrm{E}(I^i)$ and $\mathrm{E}(Z^{\frac{i}{\beta}})$ into the above expression, we can prove Proposition 2.2. In particular, for $m = 1$ and $m = 2$ , we obtain $\mathrm{E}(Y)$ and $\mathrm{E}(Y^2)$ , respectively. This allows us to derive the expressions for $\mathrm{E}(Y)$ and $\mathrm{Var}(Y)$ .

$\bullet$ Proof of Proposition 2.3

For AGT distribution, invoking Eq (2.3), we have

$\begin{align*} f_{AGT}(y;\mu,\sigma,\kappa,\alpha,\beta)& = \frac{1}{\Gamma(\alpha)\beta^\alpha}\int_{0}^{1}u^{\alpha-1}\exp\bigg(-\frac{u}{\beta}\bigg) f_{AGN}(y;\mu,u^{-1/\beta}\sigma, \kappa,\beta)\mbox{d}u\\ & = \frac{\kappa\beta^{2-\alpha}}{2^{1/\beta}\Gamma(1/\beta)\Gamma(\alpha)(1+\kappa^2)\sigma}\int_{0}^{+\infty}u^{\alpha\beta} \exp\bigg\{-\bigg[\frac{1}{2}\bigg(\kappa^{\mbox{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta+\frac{1}{\beta}\bigg]u^\beta\bigg\}\mbox{d}u\; . \end{align*}$

Let $z = \bigg[\frac{1}{2}\bigg(\kappa^{\mbox{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta+\frac{1}{\beta}\bigg]u^\beta$ , and we obtain

$\begin{array}{rcl} &&{ \int_{0}^{+\infty}u^{\alpha\beta} \exp\bigg\{-\bigg[\frac{1}{2}\bigg(\kappa^{\mbox{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta+\frac{1}{\beta}\bigg]u^\beta\bigg\}\mbox{d}u}\\ & = &{ \bigg[\frac{1}{2}\bigg(\kappa^{\mbox{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta+\frac{1}{\beta}\bigg]^{-(\alpha+\frac{1}{\beta})}\frac{1}{\beta}}{ \times\int_{0}^{+\infty}z^{\alpha+\frac{1}{\beta}-1}e^{-z}\mbox{d}z}\\ & = &{ \beta^{\alpha+\frac{1}{\beta}-1}\Gamma\bigg(\alpha+\frac{1}{\beta}\bigg)\bigg[1+\frac{\beta}{2}\bigg(\kappa^{\mbox{sign}(t-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg]^{-(\alpha+\frac{1}{\beta})}}\; . \end{array}$

Hence, we can derive the pdf of AGT distribution.

As a result, by using $f_{U|Y}(u\mid y) = \frac{f_{AGN}(y; \mu, u^{-1/\beta}\sigma, \kappa, \beta)f(u)}{f_{AGT}(y)}$ and after some algebric manipulation, we can obtain Proposition 2.3.

$\bullet$ Proof of Proposition 2.4

For AGSL distribution, we have

$\begin{align*} f_{AGSL}(y;\mu,\sigma,\kappa,\alpha,\beta)& = \alpha\int_{0}^{1}u^{\alpha-1}f_{AGN}(y;\mu,u^{-1/\beta}\sigma, \kappa,\beta)\mbox{d}u\\ & = \frac{\kappa\alpha\beta}{2^{1/\beta}(1+\kappa^2)\Gamma(1/\beta)\sigma}\int_{0}^{1}u^{\alpha+1/\beta-1}\exp\bigg[-\frac{u}{2}\bigg(\kappa^{\mbox{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg]\mbox{d}u\; , \end{align*}$

let $z = \bigg[\frac{1}{2}\bigg(\kappa^{\mbox{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^\beta\bigg]$ , and after some calculation we will derive the pdf of AGSL distribution.

By using $f_{U|Y}(u\mid y) = \frac{f_{AGN}(y; \mu, u^{-1/\beta}\sigma, \kappa, \beta)f(u)}{f_{AGSL}(y)}$ and after some algebric manipulation, we can obtain Proposition 2.4.

$\bullet$ Proof of Proposition 2.5

If $\gamma_i = 1$ , then CAGN becomes AGN.

If $0 < \gamma_i < 1$ , then the pdf of $U_j$ can be written as

$H(u_j;\boldsymbol{\alpha}) = \alpha^{\frac{1-u_j}{1-\gamma}}(1-\alpha)^{\frac{u_j-\gamma}{1-\gamma}}\; ,$

and the conditional pdf of $U_j|Y = y_j$ can be written as

$P_{U_j\mid Y_j}(u_j\mid y_j) = \frac{\bigg[\alpha_if_{AGN}(y_j;\mu_i,\gamma^{-1/\beta}\sigma,\alpha,\beta)\bigg]^{\frac{1-u_j}{1-\gamma}}\bigg[(1-\alpha)f_{AGN}(y_j;\mu,\sigma,\kappa,\beta)\bigg]^{\frac{u_j-\gamma}{1-\gamma}}}{f_{ACGN}(y_j;\mu,\sigma,\kappa,\gamma,\alpha,\beta)}\; .$

Part 2: Proofs required for Section 3

$\bullet$ Proof of Proposition 3.1

Here, we assume that $X_1, X_2, \cdots, X_n$ are identically distributed $AGN(0, 1, \kappa, \beta)$ samples of size $n$ and $X_{1:n}, X_{2:n}, \cdots, X_{n:n}$ are corresponding order statistics. In the case of $X < 0$ , the cdf of $X$ can be evaluated using the power series expansion

$\begin{align*} F_{AGN}(x,\kappa,\beta)& = \frac{\kappa^2}{(1+\kappa^2)\varGamma(1/\beta)}\bigg[\varGamma(\frac{1}{\beta})-\sum\limits_{j = 0}^{\infty}\frac{(-1)^j}{j!(\frac{1}{\beta}+j)}\bigg[\frac{1}{2}(-\frac{x}{\kappa})^{\beta}\bigg]^{\frac{1}{\beta}+j}\bigg].\\ \end{align*}$

Let $b_j = \frac{(-1)^j}{j!(\frac{1}{\beta}+j)}$ , for $j = 0, 1, \cdots,$ we have that $b_0 = \beta\neq0$ . Consider an integer $s$ order expansion of a power series

$\begin{align*} \bigg[\sum\limits_{j = 0}^{\infty}b_jy^j\bigg]^s = \sum\limits_{j = 0}^{\infty}b_j^{(s)}y^j,\; \; y > 0, \end{align*}$

where

$b_j^{(s)} = \sum\limits_{h = 0}^{j}b_h^{(s-1)}b_{j-h},\; \; b_j^{(1)} = b_j,\; \; b_j^{(0)} = \left\{\begin{aligned} 1,&&\; \; j = 0,\\ 0,&&\; \; j > 0. \end{aligned}\right.$

After that, we can obtain

$\begin{align*} [F(x)]^{n-1} = \frac{\kappa^{2(n-1)}}{(1+\kappa^2)^{n-1}[\varGamma(1/\beta)]^{n-1}}\sum\limits_{i = 0}^{n-1}(-1)^i\binom{n-1}{i}\bigg[\varGamma(\frac{1}{\beta})\bigg]^{n-i-1} \bigg\{\sum\limits_{j = 0}^{\infty}b_j\bigg[\frac{1}{2}(-\frac{x}{\kappa})^\beta\bigg]^{\frac{1}{\beta}+j}\bigg\}^i, \end{align*}$

such that

$\begin{align*} I_1& = n\int_{-\infty}^{0}x^m[F(x)]^{n-1}f(x){\mathrm{d}}x = \frac{n\kappa^{2n-1}\beta}{2^{1/\beta}(1+\kappa^2)^{n-1}}\sum\limits_{i = 0}^{n-1}(-1)^i\binom{n-1}{i}\bigg[\Gamma(\frac{1}{\beta})\bigg]^{-i-1}\sum\limits_{j = 0}^{\infty}b_j^{(i)}\\ &\int_{-\infty}^{0}x^m\bigg[\frac{1}{2}(-\frac{x}{\kappa})^\beta\bigg]^{\frac{i}{\beta}+j}\exp\bigg\{-\frac{1}{2}(-\frac{x}{\kappa})^\beta\bigg\}{\mathrm{d}}x. \end{align*}$

Let $u = \frac{1}{2}(-\frac{x}{\kappa})^\beta$ and we can easily get

$\begin{align*} I_1 = \frac{n(-1)^m\kappa^{2n+m}2^{\frac{m}{\beta}}}{(1+\kappa^2)^n}\sum\limits_{i = 0}^{n-1}(-1)^i\binom{n-1}{i}\bigg[\varGamma(\frac{1}{\beta})\bigg]^{-i-1}\sum\limits_{j = 0}^{\infty}b_j^{(i)}\varGamma(\frac{m+i+1}{\beta}+j). \end{align*}$

Similarly,

$\begin{align*} I_2& = n\int_{0}^{\infty}x^m[F(x)]^{n-1}f(x){\mathrm{d}}x = n\sum\limits_{i = 0}^{n-1}\binom{n-1}{i}\frac{\kappa^{2(n-i-1)-m}2^{\frac{m}{\beta}}}{(1+\kappa^2)^n}\bigg[\varGamma(\frac{1}{\beta})\bigg]^{-i-1}\sum\limits_{j = 0}^{\infty}b_j^{(i)}\varGamma(\frac{m+i+1}{\beta}+j). \end{align*}$

Consequently,

$\begin{align*} \mathrm{E}(X_{n:n}^m) = n\sum\limits_{i = 0}^{n-1}\sum\limits_{j = 0}^{\infty}\binom{n-1}{i}&\frac{2^{\frac{m}{\beta}}}{(1+\kappa^2)^n[\varGamma(1/\beta)]^{i+1}}[(-1)^{i+m}\kappa^{2n+m}+\kappa^{2(n-i-1)-m}]b_j^{(i)}\varGamma(\frac{m+i+1}{\beta}+j). \end{align*}$

$\bullet$ Proof of Proposition 3.2

If $X\sim AGN(0, 1, \kappa, \beta)$ when $X\leq0$ ,

$\begin{align*} F_{AGN}(x,\kappa,\beta)& = \frac{\kappa^2}{(1+\kappa^2)\varGamma(1/\beta)}\bigg[\varGamma(\frac{1}{\beta})-\int_0^{\frac{1}{2}(\frac{-x}{\kappa})^{\beta}}t^{\frac{1}{\beta}-1}e^{-t}{\mathrm{d}}t\bigg].\\ \end{align*}$

Since

$\begin{align*} \int_0^{\frac{1}{2}(\frac{-x}{\kappa})^{\beta}}t^{\frac{1}{\beta}-1}e^{-t}{\mathrm{d}}t = \sum\limits_{m = 0}^{\infty}\frac{(-1)^m}{m!(\frac{1}{\beta}+m)}\bigg[\frac{1}{2}(-\frac{x}{\kappa})^{\beta}\bigg]^{\frac{1}{\beta}+m}, \end{align*}$

we obtain

$\begin{align*} [F(x)]^{n-1} = &\frac{\kappa^{2(n-1)}}{(1+\kappa^2)^{n-1}[\varGamma(\frac{1}{\beta})]^{n-1}}\sum\limits_{i = 0}^{n-1}(-1)^i\binom{n-1}{i}[\varGamma(\frac{1}{\beta})]^{n-i-1}\bigg\{\int_0^{\frac{1}{2}(\frac{-x}{\kappa})^{\beta}}t^{\frac{1}{\beta}-1}e^{-t}{\mathrm{d}}t\bigg\}^i\\ & = \frac{\kappa^{2(n-1)}}{(1+\kappa^2)^{n-1}[\varGamma(\frac{1}{\beta})]^{n-1}}\sum\limits_{i = 0}^{n-1}(-1)^i\binom{n-1}{i}[\varGamma(\frac{1}{\beta})]^{n-i-1}\\ &\times\sum\limits_{m_1 = 0}^{\infty}\cdots\sum\limits_{m_i = 0}^{\infty}\frac{(-1)^{m_1+\cdots+m_i}}{{m_1}!\cdots{m_i}!(\frac{1}{\beta}+m_1)\cdots(\frac{1}{\beta}+m_i)}\bigg[\frac{1}{2}(\frac{-x}{\kappa})^{\beta}\bigg]^{\frac{i}{\beta}+\sum_{t = 1}^{i}m_t}. \end{align*}$

Denote $H_1(m, n-1, \kappa) = \int_{-\infty}^{0}{x^m[F(x)]^{n-1}f(x)}{\mathrm{d}}x$ ,

$\begin{align*} H_1(m,n-1,\kappa) = &\frac{\kappa^{2n-1}\beta}{2^{1/\beta}(1+\kappa^2)^{n}}\sum\limits_{i = 0}^{n-1}(-1)^i\binom{n-1}{i}[\varGamma(\frac{1}{\beta})]^{-i-1}\sum\limits_{m_1 = 0}^{\infty}\cdots\sum\limits_{m_i = 0}^{\infty}\frac{(-1)^{m_1+\cdots+m_i}}{{m_1}!\cdots{m_i}!(\frac{1}{\beta}+m_1)\cdots(\frac{1}{\beta}+m_i)}\\ &\times\int_{-\infty}^{0}x^m\bigg[\frac{1}{2}(-\frac{x}{\kappa})^{\beta}\bigg]^{\frac{i}{\beta}+\sum_{t = 1}^{i}m_t}\exp\bigg\{-\frac{1}{2}(-\frac{x}{\kappa})^{\beta}\bigg\}{\mathrm{d}}x, \end{align*}$

and let $u = \frac{1}{2}(-\frac{x}{\kappa})^{\beta}$ , then, $x = -\kappa(2u)^{\frac{1}{\beta}}$ , $\frac{{\mathrm{d}}x}{{\mathrm{d}}u} = -\frac{\kappa}{\beta}2^{\frac{1}{\beta}}u^{\frac{1}{\beta}-1}$

$\begin{align*} H_1(m,n-1,\kappa) = &\frac{(-1)^m\kappa^{2n+m}2^{\frac{m}{\beta}}}{(1+\kappa^2)^{n}}\sum\limits_{i = 0}^{n-1}(-1)^i\binom{n-1}{i}[\varGamma(\frac{1}{\beta})]^{-i-1}\sum\limits_{m_1 = 0}^{\infty}\cdots\sum\limits_{m_i = 0}^{\infty}\frac{(-1)^{m_1+\cdots+m_i}}{{m_1}!\cdots{m_i}!(\frac{1}{\beta}+m_1)\cdots(\frac{1}{\beta}+m_i)}\\ &\varGamma\bigg(\frac{m+i+1}{\beta}+\sum\limits_{t = 1}^{i}m_t\bigg). \end{align*}$

Using some algebraic operations, we can obtain

$\begin{align*} H_1(m,n-1,\kappa) = &\frac{(-1)^m\kappa^{2n+m}2^{\frac{m}{\beta}}}{(1+\kappa^2)^{n}}\sum\limits_{i = 0}^{n-1}(-\beta)^i\binom{n-1}{i}[\varGamma(\frac{1}{\beta})]^{-i-1}\varGamma(\frac{m+i+1}{\beta})\\ \times &F_{A}^{(i)}\bigg(\frac{m+i+1}{\beta},\frac{1}{\beta},\ldots,\frac{1}{\beta};\frac{1}{\beta}+1,\ldots,\frac{1}{\beta}+1;-1,\ldots,-1\bigg). \end{align*}$

Denote $H_2(m, n-1, \kappa) = \int_{0}^{\infty}{x^m[F(x)]^{n-1}f(x)}{\mathrm{d}}x$ .

In the same way, we can obtain

$\begin{align*} H_2(m,n-1,\kappa) = \sum\limits_{i = 0}^{n-1}\binom{n-1}{i}&\frac{\kappa^{2(n-i-1)-m}2^{\frac{m}{\beta}}}{(1+\kappa^2)^{n}[\varGamma(1/\beta)]^{i+1}}\varGamma(\frac{m+i+1}{\beta})\beta^i\\ &\times F_{A}^{(i)}\bigg(\frac{m+i+1}{\beta},\frac{1}{\beta},\ldots,\frac{1}{\beta};\frac{1}{\beta}+1,\ldots,\frac{1}{\beta}+1;-1,\ldots,-1\bigg). \end{align*}$

$\bullet$ Proof of Proposition 3.3

If $\frac{1}{\beta}$ is an integer, when $x < 0$ we have

$\begin{align*} F(x) & = \frac{{{\kappa ^2}}}{{(1 + {\kappa ^2}){{\mathrm{B}}}(\alpha ,1/\beta )}}\int_{{{\left[ {1 + \frac{2}{\beta }{{\left( {\frac{{ - \kappa }}{x}} \right)}^\beta }} \right]}^{ - 1}}}^1 {{t^{1/\beta-1 }}} {(1 - t)^{{\alpha - 1}}}{\mathrm{d}}t\\ & = \frac{{{\kappa ^2}}}{{(1 + {\kappa ^2}){{\mathrm{B}}}(\alpha ,1/\beta )}}\sum\limits_{i = 0}^{(1/\beta - 1)} {\left( {\begin{array}{*{20}{c}} {1/\beta - 1}\\ i \end{array}} \right)} \int_{{{\left[ {1 + \frac{2}{\beta }{{\left( {\frac{{ - \kappa }}{x}} \right)}^\beta }} \right]}^{ - 1}}}^1 {{{(1 - t)}^{\alpha + i - 1}}{\mathrm{d}}t}, \end{align*}$

and by binomial expansion we obtain

$F(x) = \frac{\kappa^2}{(1+\kappa^2) {\mathrm{B}}(\alpha,1/\beta)}\sum\limits_{i = 0}^{(1/\beta-1)}C_i\frac{1}{[1+\frac{\beta}{2}(-\frac{x}{\kappa})^\beta]^{\alpha+i}},$

where $C_i = (-1)^i\binom{1/\beta-1}{i}\frac{1}{\alpha+i}.$

Let $T_v(1/\beta, n-1)$ be the coefficient of $[1+\frac{\beta}{2}(-\frac{x}{\kappa})^\beta]^{-v}$ in the expansion of ${\left\{ {\sum\limits_{i = 0}^{(1/\beta - 1)} {{C_i}} \frac{1}{{{{\left[ {1 + \frac{\beta }{2}{{\left({\frac{{ - x}}{\kappa }} \right)}^\beta }} \right]}^i}}}} \right\}^{n - 1}}$ . Clearly, we have ${T_v}(1/\beta, 0) = 1$ .

For $v = 0, 1, \cdots, (1/\beta)$ ,

${T_v}(1/\beta ,1) = {C_v},$

and for $n > 2$ , $v = 0, 1, \cdots, (1/\beta)(n-1)$ ,

$\begin{align*} {T_v}\left( {\frac{1}{\beta },n - 1} \right) = \sum\limits_{i = 0}^v {{C_i}} {T_{v - i}}\left( {\frac{1}{\beta },n - 2} \right). \end{align*}$

We can calculate every coefficient directly, so ${[F(x)]^{n - 1}}$ can be expressed as

$\begin{align*} {[F(x)]^{n - 1}} = \frac{{{\kappa ^{2(n - 1)}}}}{{{{(1 + {\kappa ^2})}^{n - 1}}{{\mathrm{B}}}\left( {\alpha ,1/\beta } \right)}}\sum\limits_{v = 0}^{(1/\beta - 1)(n - 1)} {{T_v}(1/} \beta ,n - 1)\frac{1}{{{{\left[ {1 + \frac{\beta }{2}{{\left( {\frac{{ - x}}{\kappa }} \right)}^\beta }} \right]}^{\alpha (n - 1) + v}}}}, \end{align*}$

then, we have

$\begin{align*} nI_1(n-1,\kappa) = &n\int_{-\infty}^{0}x^m[F(x)]^{n-1}f(x) \mathrm{d}x\\ = &n\frac{(-1)^m(2/\beta )^{\frac{m}{\beta}}\kappa^{2n+m}}{(1+\kappa^2)^n[ {\mathrm{B}}(\alpha,1/\beta)]^n}\sum\limits_{v = 0}^{(1/\beta-1)(n-1)}T_v(1/\beta,n-1)\times {\mathrm{B}}\bigg(\frac{m+1}{\beta},\alpha n+v-\frac{m}{\beta}\bigg)\; , \end{align*}$

and the above expression exists for $\alpha\beta > \frac{m}{n}$ . It is not hard to verify the following statement

$\begin{align*} n{I_2}(n-1,\kappa) & = n\int_0^\infty {{x^m}} {[F(x)]^{n - 1}}f(x){\mathrm{d}}x = n\sum\limits_{j = 0}^{n - 1} {{{( - 1)}^{m + j}}\left( {\begin{array}{*{20}{c}} {n - 1}\\ j \end{array}} \right)} {I_1}(j,1/\kappa ). \end{align*}$

Consequently,

$\begin{align*} &{ \mathrm{E}}(X_{n:n}^m) = n\frac{{{{(2/\beta )}^{\frac{m}{\beta }}}}}{{{{(1 + {\kappa ^2})}^n}}}\left[ {\frac{{{{( - 1)}^m}{\kappa ^{2n + m}}}}{{{{[B(\alpha ,1/\beta )]}^n}}}\sum\limits_{v = 0}^{(1/\beta - 1)(n - 1)} {{T_v}(1/} \beta ,n - 1) \times B\left( {\frac{{m + 1}}{\beta },\alpha n + v - \frac{m}{\beta }} \right)} \right.\\ &\quad \quad \quad \left. {\quad \quad + \sum\limits_{j = 0}^{n - 1} {{{( - 1)}^j}\left( {\begin{array}{*{20}{c}} {n - 1}\\ j \end{array}} \right)} \frac{1}{{{\kappa ^m}{{[B(\alpha ,1/\beta )]}^j}}}\sum\limits_{v = 0}^{(1/\beta - 1)j} {{T_v}(1/} \beta ,j) \times B\left( {\frac{{m + 1}}{\beta },\alpha (j + 1) + v - \frac{m}{\beta }} \right)} \right]. \end{align*}$

$\bullet$ Proof of Proposition 3.4

Denote $J(m, n-1, \kappa) = \int_{-\infty}^{0}{x^m[F(x)]^{n-1}f(x)}{\mathrm{d}}x$ . When $x < 0$ , $\frac{1}{\beta}$ is not an integer and we have

$\begin{align*} F(x) = \frac{\kappa^2}{(1+\kappa^2) {\mathrm{B}}(\alpha,1/\beta)}\sum\limits_{i = 0}^{\infty}(-1)^i\binom{1/\beta-1}{i}\frac{1}{\alpha+i}\frac{1}{[1+\frac{\beta}{2}(-\frac{x}{\kappa})^\beta]^{\alpha+i}}, \end{align*}$

where

$\begin{align*} \sum\limits_{i = 0}^{\infty}(-1)^i\binom{1/\beta-1}{i}\frac{1}{\alpha+i} = \sum\limits_{i = 0}^{\infty}\frac{(1-\frac{1}{\beta})_i}{i!(\alpha+i)}. \end{align*}$

Let $C = \frac{2\kappa^{2n-1}}{(2/\beta)^{1+\frac{1}{\beta}}(1+\kappa^2)^n[{\mathrm{B}}(\alpha, 1/\beta)]^n}$ , such that, $J(m, n-1, \kappa)$ can be represented as

$\begin{align*} J(m,n-1,\kappa)& = C\int_{-\infty}^{0}x^m\bigg\{\sum\limits_{i = 0}^{\infty}\frac{(1-\frac{1}{\beta})_i}{i!(\alpha+i)}\frac{1}{\bigg[1+\frac{\beta}{2}(-\frac{x}{\kappa})^{\beta}\bigg]^{\alpha+i}}\bigg\}^{n-1}\frac{1}{\bigg[1+\frac{\beta}{2}(-\frac{x}{\kappa})^{\beta}\bigg]^{(\alpha+1/\beta)}}\\ & = C\sum\limits_{m_1 = 0}^{\infty}\cdots\sum\limits_{m_{n-1} = 0}^{\infty}\frac{\Pi_{i = 1}^{n-1}{(1-\frac{1}{\beta})_{m_i}}}{\Pi_{i = 1}^{n-1}(\alpha+m_i){m_i}!}\int_{-\infty}^{0}x^m\frac{1}{\bigg[1+\frac{\beta}{2}(-\frac{x}{\kappa})^{\beta}\bigg]^{\alpha n+\sum_{i = 1}^{n-1}m_i+1/\beta}}{\mathrm{d}}x. \end{align*}$

Let $\frac{\frac{\beta}{2}(-\frac{x}{\kappa})^\beta}{1+\frac{\beta}{2}(-\frac{x}{\kappa})^\beta} = u$ , then, the above integral can be converted to

$\begin{align*} \int_{-\infty}^{0}x^m\frac{1}{\bigg[1+\frac{\beta}{2}\bigg(-\frac{x}{\kappa}\bigg)^{\beta}\bigg]^{\alpha n+\sum_{i = 1}^{n-1}m_i+1/\beta}}{\mathrm{d}}x& = \frac{(-1)^m\kappa^{m+1}2^{\frac{m+1}{\beta}}}{\beta^{\frac{m+1}{\beta}+1}}\int_0^1 u^{\frac{m+1}{\beta}}(1-u)^{\alpha n+\sum_{i = 1}^{n-1}m_i-\frac{m}{\beta}-1}{\mathrm{d}}u\\ & = \frac{(-1)^m\kappa^{m+1}2^{\frac{m+1}{\beta}}}{\beta^{\frac{m+1}{\beta}+1}}{{\mathrm{B}}}\bigg(\frac{m+1}{\beta},\alpha n+\sum\limits_{i = 1}^{n-1}m_i-\frac{m}{\beta}\bigg). \end{align*}$

Substituting the above equation into $J(m, n-1, \kappa)$ , and using some algebraic operations, we can obtain

$\begin{align*} J(m,n-1,\kappa) = &\frac{(-1)^mk^{2n+m}(2/\beta)^{\frac{m}{\beta}}{{\mathrm{B}}}(\frac{m+1}{\beta},\alpha n-\frac{m}{\beta})}{(1+\kappa^2)^n[{{\mathrm{B}}}(\alpha,1/\beta)]^na^{n-1}}\times F_{1:1}^{1:2}\bigg((\alpha n-\frac{m}{\beta}):(1-\frac{1}{\beta},\alpha);\cdots;(1-\frac{1}{\beta},\alpha);\\ &(\alpha n+\frac{1}{\beta}):(\alpha+1);\cdots,(\alpha+1);1,\cdots,1\bigg). \end{align*}$

It is worth noting that, when $\alpha\beta > \frac{m}{n}$ , $B\left(\frac{m+1}{\beta}, \alpha n+\sum_{i = 1}^{n-1}m_i-\frac{m}{\beta} \right) < \Gamma(\frac{m+1}{\beta})$ , On the other hand, for a sufficiently large $N$ , we can bound that

$\begin{equation*} \begin{aligned} &\left|\; {\mathop {\sum { \cdots \sum {} } }\limits_{\max ({m_1}, \cdots ,{m_{n- 1}}) > N} \frac{{{{(1 - b)}_{{m_1}}}{{(1 - b)}_{{m_2}}} \cdots {{(1 - b)}_{{m_{n - 1}}}}}}{{(a + {m_1})(a + {m_2}) \cdots (a + {m_{n - 1}}){m_1}! \cdots {m_{n - 1}}!}}} \times B\left( \frac{m+1}{\beta}, \alpha n+\sum\limits_{i = 1}^{n-1}m_i-\frac{m}{\beta} \right)\; \right|\\ & < \Gamma\left(\frac{m+1}{\beta}\right)\mathop {\sum { \cdots \sum {} } }\limits_{\max ({m_1}, \cdots ,{m_{n - 1}}) > N} \frac{{\left| {{{(1 - b)}_{{m_1}}}{{(1 - b)}_{{m_2}}} \cdots {{(1 - b)}_{{m_{n - 1}}}}} \right|}}{{(a + {m_1})(a + {m_2}) \cdots (a + {m_{n - 1}}){m_1}! \cdots {m_{n - 1}}!}}\\ & = \Gamma\left(\frac{m+1}{\beta}\right) \left[ {\sum\limits_{{m_1} = 0}^\infty \cdots \sum\limits_{{m_{n - 1}} = 0}^\infty {\frac{{\left| {{{(1 - b)}_{{m_1}}}{{(1 - b)}_{{m_2}}} \cdots {{(1 - b)}_{{m_{n - 1}}}}} \right|}}{{(a + {m_1})(a + {m_2}) \cdots (a + {m_{n - 1}}){m_1}! \cdots {m_{n - 1}}!}}} } \right.\\ &\left. {{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} - \sum\limits_{{m_1} = 0}^N \cdots \sum\limits_{{m_{n - 1}} = 0}^N {\frac{{\left| {{{(1 - b)}_{{m_1}}}{{(1 - b)}_{{m_2}}} \cdots {{(1 - b)}_{{m_{n - 1}}}}} \right|}}{{(a + {m_1})(a + {m_2}) \cdots (a + {m_{n - 1}}){m_1}! \cdots {m_{n - 1}}!}}} } \right]\\ & = \Gamma\left(\frac{m+1}{\beta}\right){\left[ {\left\{ {\sum\limits_{m = 0}^\infty {\frac{{\left| {{{(1 - b)}_m}} \right|}}{{(a + m)m!}}} } \right\}} \right.^{n - 1}}\left. { - {{\left\{ {\sum\limits_{m = 0}^N {\frac{{\left| {{{(1 - b)}_m}} \right|}}{{(a + m)m!}}} } \right\}}^{n - 1}}} \right] < \infty . \end{aligned} \end{equation*}$

This means that for a maximum order statistic from $AGT(0, 1, \kappa, \alpha, \beta)$ , it's m-th moments always exists.

$\bullet$ Proof of Proposition 3.5

Suppose that $Y\sim ACGN(0, 1, \kappa, \lambda, \alpha, \beta)$ , $X\sim AGN(0, 1, \kappa, \alpha, \beta)$ , $U$ is a discrete random variable with pdf defined in (2.5), then we have

$\begin{align*} {\mathrm{E}}(Y_{r:n})& = P(U = 1){\mathrm{E}}[Y_{r:n}\mid U = 1]+P(U = \lambda){\mathrm{E}}[Y_{r:n}\mid U = \lambda]\\ & = P(U = 1){\mathrm{E}}[{[U^{-\frac{1}{\beta}}X]}_{r:n}\mid U = 1]+P(U = \lambda){\mathrm{E}}[[U^{-\frac{1}{\beta}}X]_{r:n}\mid U = \lambda]\\ & = P(U = 1){\mathrm{E}}[X_{r:n}\mid U = 1]+P(U = \lambda){\mathrm{E}}[\lambda^{-\frac{1}{\beta}}X_{r:n}\mid U = \lambda]. \end{align*}$

In view of the independence between $U$ and $X$ ,

$\begin{align*} {\mathrm{E}}(Y_{r:n})& = P(U = 1){\mathrm{E}}(X_{r:n})+P(U = \lambda)\lambda^{-\frac{1}{\beta}}{\mathrm{E}}(X_{r:n}) = [\alpha+\lambda^{-\frac{1}{\beta}}(1-\alpha)]{\mathrm{E}}(X_{r:n}). \end{align*}$

Part 3: Proofs in section five and details to obtain the information matrix

Proof of Theorem 5.1

It is easy to prove that the distributions of the GSMAGN family satisfy item one and two of Lemma 5.1. As a result, our main task is to verify the GSMAGN family satisfies item three under different circumstances.

Let $v = \frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^{\beta}$ . According to Proposition 2.1, we can obtain that $v\stackrel{d}{ = }{U^{-1}Z}$ , which illustrates that $\mathrm{E}(v^k) = \mathrm{E}(U^{-k}Z^k)$ . Taking the first partial derivation with respect to $\mu, \sigma, \beta, \kappa$ ,

$\begin{align*} \frac{\partial v}{\partial\mu}& = \frac{\beta}{2\sigma} \mathrm{sign}(\mu-y)\kappa^{ \mathrm{sign}(y-\mu)}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^{\beta-1},\\ \frac{\partial v}{\partial \sigma}& = -\frac{\beta}{2\sigma}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^{\beta},\\ \frac{\partial v}{\partial\beta}& = \frac{1}{2}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^{\beta} \mathrm{ln}\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg),\\ \frac{\partial v}{\partial\kappa}& = \frac{\beta}{2\kappa} \mathrm{sign}(y-\mu)\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)^{\beta}. \end{align*}$

$\bullet$ For AGN

$\mathrm{ln}f_{AGN} = \mathrm{ln}\kappa+ \mathrm{ln}\beta-\frac{1}{\beta} \mathrm{ln}2- \mathrm{ln}(1+\kappa^2)- \mathrm{ln}\Gamma(1/\beta)- \mathrm{ln}\sigma-v.$

Take sufficiently small $\epsilon > 0$ , $U_{\theta_0} = \{\theta:\parallel\theta_0-\theta\parallel < \epsilon\}\subset\tilde{\Theta}, \forall \theta \in U_{\theta_0}$ , then

$\begin{align*} \left|\frac{\partial \mathrm{ln}f_{AGN}}{\partial\mu}\right|& = \left|-\frac{\partial v}{\partial \mu}\right|\leq\frac{\beta}{2^{1/\beta}\sigma}\kappa^{ \mathrm{sign}(y-\mu)}v^{1-1/\beta} = g_1(v),\\ \left|\frac{\partial \mathrm{ln}f_{AGN}}{\partial\sigma}\right|& = \left|-\frac{1}{\sigma}-\frac{\partial v}{\partial \sigma}\right|\leq\frac{1}{\sigma}+\frac{\beta}{\sigma}v\leq\frac{1}{\sigma_0-\epsilon}+\frac{\beta}{\sigma}v = G_1+g_2(v),\\ \left|\frac{\partial \mathrm{ln}f_{AGN}}{\partial \beta}\right|& = \left|\frac{1}{\beta}+\frac{1}{\beta^2} \mathrm{ln}2+\varphi(1/\beta)\frac{1}{\beta^2}-\frac{\partial v}{\partial \beta}\right|\leq\frac{1}{\beta}+\frac{1}{\beta^2} \mathrm{ln}2+\frac{\varphi(1/\beta)}{\beta^2}+\frac{\partial v}{\partial \beta}\\ &\leq\frac{1}{\beta_0-\epsilon}+\frac{1}{(\beta_0-\epsilon)^2} \mathrm{ln}2+\frac{\varphi(1/(\beta_0-\epsilon))}{(\beta_0-\epsilon)^2}+\frac{2v^2}{\beta}\\ & = G_2+g_3(v),\\ \left|\frac{\partial \mathrm{ln}f_{AGN}}{\partial \kappa}\right|& = \left|\frac{1}{\kappa}+\frac{2\kappa}{1+\kappa^2}-\frac{\partial v}{\partial \kappa}\right|\leq\frac{1}{\kappa}+\frac{2\kappa}{1+\kappa^2}+\frac{\beta}{\kappa}v\\ &\leq\frac{1+3(\kappa_0+\epsilon)^2}{(\kappa_0+\epsilon)[1+(\kappa_0+\epsilon)^2]}+\frac{\beta}{\kappa}v\\ & = G_3+g_4(v). \end{align*}$

Let $G(y) = \mathrm{max}\{G_1, G_2, G_3\}+g_1(v)+g_2(v)+g_3(v)+g_4(v)$ , for AGN, where $U = 1$ , and we can get $\mathrm{E}(v^k) = \mathrm{E}(Z^k), Z\sim Gamma(1/\beta, 1)$ , for $\beta > 0$ . $\mathrm{E}(v^k) < \infty, k > -\frac{1}{\beta}$ , then we have $\mathrm{E}[G(y)] < \infty$ .

$\bullet$ For AGT

$\begin{align*} \mathrm{ln}f_{AGT}&\propto \mathrm{ln}\kappa-(1+\frac{1}{\beta}) \mathrm{ln}\frac{2}{\beta}- \mathrm{ln}(1+\kappa^2)- \mathrm{ln}B(\alpha,1/\beta)- \mathrm{ln}\sigma-(\alpha+\frac{1}{\beta}) \mathrm{ln}(1+\beta v),\\ & = \mathrm{ln}\kappa-(1+\frac{1}{\beta}) \mathrm{ln}\frac{2}{\beta}- \mathrm{ln}(1+\kappa^2)- \mathrm{ln}\Gamma(\alpha)- \mathrm{ln}\Gamma(\frac{1}{\beta})+ \mathrm{ln}\Gamma(\alpha+\frac{1}{\beta})- \mathrm{ln}\sigma-(\alpha+\frac{1}{\beta}) \mathrm{ln}(1+\beta v). \end{align*}$

Take sufficiently small $\epsilon > 0$ , $U_{\theta_0} = \{\theta:\parallel\theta_0-\theta\parallel < \epsilon\}\subset\tilde{\Theta}, \forall \theta \in U_{\theta_0}$ ,

$\begin{align*} \left|\frac{\partial\ln f_{AGT}}{\partial\mu}\right|& = \left|-(\alpha+\frac{1}{\beta})\frac{\beta}{1+\beta v}\frac{\partial v}{\partial \mu}\right| \leq\frac{\alpha\beta+1}{1+\beta v}\frac{\beta}{2^{1/\beta}\sigma}\kappa^{ \mathrm{sign}(y-\mu)}v^{1-1/\beta}\leq\frac{\alpha\beta^2+\beta}{2^{1/\beta}\sigma}\kappa^{ \mathrm{sign}(y-\mu)}v^{1-\frac{1}{\beta}} = g_1(v),\\ \left|\frac{\partial\ln f_{AGT}}{\partial\sigma}\right|& = \left|-\frac{1}{\sigma}-(\alpha+\frac{1}{\beta})\frac{\beta}{1+\beta v}\frac{\partial v}{\partial \sigma}\right|\leq\frac{1}{\sigma}+(\alpha+\frac{1}{\beta})\frac{\beta}{1+\beta v}\frac{\beta}{\sigma}v\leq\frac{1}{\sigma}+\frac{\alpha\beta+1}{\sigma}\\ &\leq\frac{1}{\sigma_0-\epsilon}+\frac{(\alpha_0+\epsilon)(\beta_0+\epsilon)}{\sigma_0-\epsilon} = G_1,\\ \left|\frac{\partial\ln f_{AGT}}{\partial \alpha}\right|& = \left|-\varphi(\alpha)+\varphi(\alpha+\frac{1}{\beta})-\ln(1+\beta v)\right|\leq\varphi(\alpha)+\varphi(\alpha+\frac{1}{\beta})+\ln(1+\beta v)\\ &\leq\varphi(\alpha_0+\epsilon)+\varphi(\alpha_0+\epsilon+\frac{1}{\beta_0-\epsilon})+\beta v = G_2+g_2(v),\\ \left|\frac{\partial\ln f_{AGT}}{\partial \beta}\right|& = \left|\frac{1}{\beta^2}\ln\frac{2}{\beta}+\frac{1}{\beta}+\frac{1}{\beta^2}+\frac{1}{\beta^2}\varphi(1/\beta)-\frac{1}{\beta^2}\varphi(\alpha+\frac{1}{\beta})+\frac{1}{\beta^2}\ln(1+\beta v)-(\alpha+\frac{1}{\beta})\frac{1}{1+\beta v}(v+\beta\frac{\partial v}{\partial \beta})\right|\\ &\leq\frac{1}{\beta^2}\ln\frac{2}{\beta}+\frac{1}{\beta}+\frac{1}{\beta^2}+\frac{1}{\beta^2}\varphi(1/\beta)+\frac{1}{\beta^2}\varphi(\alpha+\frac{1}{\beta})+\frac{1}{\beta^2}\ln(1+\beta v)+(\alpha+\frac{1}{\beta})\frac{1}{1+\beta v}(v+\beta\frac{2v^2}{\beta})\\ &\leq\frac{1}{\beta^2}\left[\ln\frac{2}{\beta}+(\alpha+1)\beta+\varphi(\frac{1}{\beta})+\varphi(\alpha+\frac{1}{\beta})+2\right]+(2\alpha+\frac{2}{\beta}+1)\frac{v}{\beta}\\ &\leq\frac{1}{(\beta_0-\epsilon)^2}\left[\ln\frac{2}{\beta_0-\epsilon}+(\alpha_0+\epsilon+1)(\beta_0+\epsilon)+\varphi(\frac{1}{\beta_0-\epsilon})+\varphi(\alpha_0+\epsilon+\frac{1}{\beta_0-\epsilon})+2\right]\\ &+(2\alpha+\frac{2}{\beta}+1)\frac{v}{\beta} = G_3+g_3(v),\\ \left|\frac{\partial\ln f_{AGT}}{\partial \kappa}\right|& = \left|\frac{1}{\kappa}-\frac{2\kappa}{1+\kappa^2}-(\alpha+\frac{1}{\beta})\frac{\beta}{1+\beta v}\frac{\partial v}{\partial \kappa}\right|\leq\frac{1}{\kappa}+\frac{2\kappa}{1+\kappa^2}+\frac{\alpha\beta+1}{1+\beta v}\frac{\beta}{\kappa}v\\ &\leq\frac{1+3(\kappa_0+\epsilon)^2}{(\kappa_0+\epsilon)[1+(\kappa_0+\epsilon)^2]}+\frac{(\alpha_0+\epsilon)(\beta_0+\epsilon)+1}{\kappa_0-\epsilon} = G_4. \end{align*}$

Let $G(y) = \mathrm{max}\{G_1, G_2, G_3, G_4\}+g_1(v)+g_2(v)+g_3(v)$ , for AGT, $\mathrm{E}(v^k) = \mathrm{E}(U^{-k}) \mathrm{E}(Z^k)$ , where $U\sim Gamma(\alpha, \beta), Z\sim Gamma(1/\beta, 1)$ . Invoking Propostion 2, if $\alpha > 1$ , $\mathrm{E}(v) < \infty$ , and if $\alpha+\frac{1}{\beta} > 1$ , $\mathrm{E}(v^{1-\frac{1}{\beta}}) < \infty$ , such that, when $\alpha > 1$ , we have $\mathrm{E}[G(y)] < \infty$ . $\bullet$ For AGSL

$\mathrm{ln}f_{AGSL} = \mathrm{ln}\kappa+ \mathrm{ln}\alpha+ \mathrm{ln}\beta+ \mathrm{ln}\Gamma(\alpha+1/\beta)-\frac{1}{\beta} \mathrm{ln}2- \mathrm{ln}\Gamma(1/\beta)- \mathrm{ln}\sigma+ \mathrm{ln}\gamma(\alpha+\frac{1}{\beta},v)-(\alpha+\frac{1}{\beta}) \mathrm{ln}v,$

where $\gamma(\alpha+\frac{1}{\beta}; v) = \frac{1}{\varGamma(\alpha+1/\beta)}\int_{0}^{v}t^{\alpha+\frac{1}{\beta}-1}e^{-t} \mathrm{d}t$ . By taking sufficiently small $\epsilon > 0$ , $U_{\theta_0} = \{\theta:\parallel\theta_0-\theta\parallel < \epsilon\}\subset\tilde{\Theta}, \forall \theta \in U_{\theta_0}$ , then

$\begin{align*} \left|\frac{\partial \mathrm{ln}f_{AGSL}}{\partial\mu}\right|& = \left|\frac{1}{\gamma(\alpha+\frac{1}{\beta};v)}\frac{1}{\varGamma(\alpha+\frac{1}{\beta})}v^{\alpha+\frac{1}{\beta}-1}e^{-v}\frac{\partial v}{\partial \mu}-(\alpha+\frac{1}{\beta})\frac{1}{v}\frac{\partial v}{\partial \mu}\right|\\ &\leq\left|\frac{v^{\alpha+\frac{1}{\beta}-1}e^{-v}}{\int_{0}^{v}t^{\alpha+\frac{1}{\beta}-1}e^{-t} \mathrm{d}t}-\frac{\alpha+1/\beta}{v}\right|\left|\frac{\partial v}{\partial \mu}\right|\leq\bigg(\frac{\alpha+1/\beta}{v}-\frac{\alpha+1/\beta}{ve^v}\bigg)\left|\frac{\partial v}{\partial \mu}\right|\\ &\leq\frac{\alpha\beta+1}{2^{1/\beta}\sigma}\kappa^{ \mathrm{sign}(y-\mu)}v^{1-\frac{1}{\beta}} = g_1(v),\\ \left|\frac{\partial \mathrm{ln}f_{AGSL}}{\partial\sigma}\right|& = \left|-\frac{1}{\sigma}+\frac{1}{\gamma(\alpha+\frac{1}{\beta};v)}\frac{1}{\varGamma(\alpha+\frac{1}{\beta})}v^{\alpha+\frac{1}{\beta}-1}e^{-v}\frac{\partial v}{\partial \sigma}-(\alpha+\frac{1}{\beta})\frac{1}{v}\frac{\partial v}{\partial \sigma}\right|\\ &\leq\frac{1}{\sigma}+\frac{v^{\alpha+\frac{1}{\beta}-1}e^{-v}}{\int_{0}^{v}t^{\alpha+\frac{1}{\beta}-1}e^{-t} \mathrm{d}t}\frac{\beta}{\sigma}v+(\alpha+\frac{1}{\beta})\frac{1}{v}\frac{\beta}{\sigma}v\leq\frac{2\alpha\beta+3}{\sigma}\leq\frac{2(\alpha_0+\epsilon)(\beta_0+\epsilon)+3}{\sigma_0-\epsilon} = G_1, \end{align*}$

$\begin{align*} \left|\frac{\partial \mathrm{ln}f_{AGSL}}{\partial\alpha}\right| = &\bigg|\frac{1}{\alpha}+\varphi(\alpha+1/\beta)+\frac{1}{\gamma(\alpha+\frac{1}{\beta};v)}\bigg(\frac{1}{\varGamma(\alpha+\frac{1}{\beta})}\int_0^{v}t^{\alpha+\frac{1}{\beta}-1} \mathrm{ln}te^{-t} \mathrm{d}t\\ &-\frac{\varphi(\alpha+\frac{1}{\beta})}{\varGamma^2(\alpha+\frac{1}{\beta})}\int_0^{v}t^{\alpha+\frac{1}{\beta}-1}e^{-t} \mathrm{d}t\bigg)-\ln v\bigg|\\ &\leq\frac{1}{\alpha}+\varphi(\alpha+1/\beta)+\frac{\int_0^{v}t^{\alpha+\frac{1}{\beta}-1}\ln te^{-t} \mathrm{d}t}{\int_0^{v}t^{\alpha+\frac{1}{\beta}-1}e^{-t} \mathrm{d}t}+\varphi(\alpha+1/\beta)+|\ln v|\\ &\leq\frac{1}{\alpha}+2\varphi(\alpha+1/\beta)+2|\ln v|\\ &\leq\frac{1}{\alpha_0-\epsilon}+2\varphi(\alpha_0+\epsilon+1/(\beta_0-\epsilon))+2v = G_2+g_2(v), \end{align*}$

$\begin{align*} \left|\frac{\partial \mathrm{ln}f_{AGSL}}{\partial\kappa}\right|& = \left|\frac{1}{\kappa}-\frac{2\kappa}{1+\kappa^2}+\frac{1}{\gamma(\alpha+\frac{1}{\beta};v)}\frac{1}{\varGamma(\alpha+\frac{1}{\beta})}v^{\alpha+\frac{1}{\beta}-1}e^{-v}\frac{\partial v}{\partial \kappa}-(\alpha+\frac{1}{\beta})\frac{1}{v}\frac{\partial v}{\partial \kappa}\right|\\ &\leq\frac{1}{\kappa}+\frac{2\kappa}{1+\kappa^2}+\frac{v^{\alpha+\frac{1}{\beta}-1}e^{-v}}{\int_0^{v}t^{\alpha+\frac{1}{\beta}-1}e^{-t} \mathrm{d}t}\frac{\partial v}{\partial \kappa}+(\alpha+\frac{1}{\beta})\frac{1}{v}\frac{\partial v}{\partial \kappa}\\ &\leq\frac{1}{\kappa}+\frac{2\kappa}{1+\kappa^2}+\frac{\alpha+1/\beta}{v}\frac{\beta}{\kappa}v+(\alpha+\frac{1}{\beta})\frac{1}{v}\frac{\beta}{\kappa}v\\ &\leq\frac{1}{\kappa_0-\epsilon}+\frac{2(\kappa_0+\epsilon)}{1+(\kappa_0-\epsilon)^2}+\frac{2(\alpha_0+\epsilon)(\beta_0+\epsilon)+1}{\kappa_0-\epsilon} = G_3, \end{align*}$

$\begin{align*} \left|\frac{\partial\ln f_{AGSL}}{\partial\beta}\right|& = \left|\frac{1}{\beta}-\frac{1}{\beta^2}\varphi(\alpha+\frac{1}{\beta})+\frac{1}{\beta^2}\ln 2+\frac{1}{\beta^2}\varphi(\frac{1}{\beta})+\frac{v^{\alpha+\frac{1}{\beta}-1}e^{-v}}{\int_0^{v}t^{\alpha+\frac{1}{\beta}-1}e^{-t} \mathrm{d}t}\frac{\partial v}{\partial \beta}-\frac{1}{\beta^2}\frac{\int_0^{v}t^{\alpha+\frac{1}{\beta}-1} \mathrm{ln}te^{-t} \mathrm{d}t}{\int_0^{v}t^{\alpha+\frac{1}{\beta}-1}e^{-t} \mathrm{d}t}\right|\\ &\leq\frac{1}{\beta}+\frac{1}{\beta^2}\varphi(\alpha+\frac{1}{\beta})+\frac{1}{\beta^2} \mathrm{ln}2+\frac{1}{\beta^2}\varphi(\frac{1}{\beta})+\frac{\alpha+\frac{1}{\beta}}{v}\frac{\partial v}{\partial \beta}+\frac{1}{\beta^2} \mathrm{ln}v\\ &\leq\frac{1}{\beta}+\frac{1}{\beta^2}\varphi(\alpha+\frac{1}{\beta})+\frac{\alpha\beta+2}{\beta^2} \mathrm{ln}2+\frac{\alpha\beta+2}{\beta^2} \mathrm{ln}v\\ &\leq\frac{1}{\beta_0-\epsilon}+\frac{1}{(\beta_0-\epsilon)^2}\varphi(\alpha_0+\epsilon+\frac{1}{\beta_0-\epsilon})+\frac{(\alpha_0+\epsilon)(\beta_0+\epsilon)+2}{(\beta_0-\epsilon)^2} \mathrm{ln}2+\frac{\alpha\beta+2}{\beta^2}v\\ & = G_4+g_3(v). \end{align*}$

Let $G(y) = \mathrm{max}\{G_1, G_2, G_3, G_4\}+g_1(v)+g_2(v)+g_3(v)$ . For AGSL, $\mathrm{E}(v^k) = \mathrm{E}(U^{-k}) \mathrm{E}(Z^k)$ , where $U\sim Beta(\alpha, 1), Z\sim Gamma(1/\beta, 1)$ . Invoking Propostion 2, we can obtain that if $\alpha > 1$ , then $\mathrm{E}(v) < \infty$ . For $\alpha+\frac{1}{\beta} > 1$ , $\mathrm{E}(v^{1-\frac{1}{\beta}}) < \infty$ , such that $\alpha > 1$ , $\mathrm{E}[G(y)] < \infty$ .

$\bullet$ For CAGN

$f_{CAGN}(y;\mu,\sigma,\kappa,\lambda,\alpha,\beta) = \alpha f_{AGN}(y;\mu,\lambda^{-1/\beta}\sigma,\kappa,\beta)+(1-\alpha)f_{AGN}(y;\mu,\sigma,\kappa,\beta).$

As for CAGN, it can be written as a linear combination of two AGN distributions. According to the above proof, it can be obtained that the MLE of AGN distribution exists and satisfies the strong consistency. Therefore, it can be easily concluded that CAGN satisfies the same property.

Details to obtain information matrix:

$\bullet$ For AGT

$M_{i,k} = \frac{1}{\beta^{\alpha}\Gamma(\alpha)}\frac{\Gamma(\alpha+\frac{1}{\beta}+k)}{\left[\frac{1}{2}{A_i}^{\beta}+\frac{1}{\beta}\right]^{\alpha+\frac{1}{\beta}+k}},$

and,

$s_{i,\alpha} = -\ln\beta-\varphi(\alpha)+\frac{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}-1}\ln u\exp\bigg\{-\bigg(\frac{1}{2}{A_i}^\beta+\frac{1}{\beta} \bigg) u\bigg\} \mathrm{d}u}{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}-1}\exp\bigg\{-\bigg(\frac{1}{2}{A_i}^\beta+\frac{1}{\beta} \bigg) u\bigg\} \mathrm{d}u}$

$\begin{align*} s_{i,\beta} = &\frac{1-\alpha}{\beta}+\frac{\varphi(1/\beta)+\ln 2}{\beta^2}-\frac{1}{\beta^2}\frac{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}-1}\ln u\exp\bigg\{-\bigg(\frac{1}{2}{A_i}^\beta+\frac{1}{\beta} \bigg) u\bigg\} \mathrm{d}u}{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}-1}\exp\bigg\{-\bigg(\frac{1}{2}{A_i}^\beta+\frac{1}{\beta} \bigg) u\bigg\} \mathrm{d}u}\\ &+\left[\frac{1}{\beta^2}-\frac{1}{2}{A_i}^\beta\ln\bigg(\kappa^{ \mathrm{sign}(y_i-\mu)}\frac{|y_i-\mu|}{\sigma}\bigg)\right] \times\frac{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}}\exp\bigg\{-\bigg(\frac{1}{2}{A_i}^\beta+\frac{1}{\beta} \bigg) u\bigg\} \mathrm{d}u}{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}-1}\exp\bigg\{-\bigg(\frac{1}{2}{A_i}^\beta+\frac{1}{\beta} \bigg) u\bigg\} \mathrm{d}u}. \end{align*}$

$\bullet$ For AGSL

$M_{i,k} = \alpha2^{\alpha+\frac{1}{\beta}+\kappa}{A_i}^{-[(\alpha+\kappa)\beta+1]}\gamma\bigg(\alpha+\frac{1}{\beta}+\kappa,\frac{1}{2}{A_i}^{\beta}\bigg),$

and,

$s_{i,\alpha} = \frac{1}{\alpha}+\frac{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}-1}\ln u\exp\bigg\{-\frac{1}{2}{A_i}^\beta u\bigg\} \mathrm{d}u}{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}-1}\exp\bigg\{-\frac{1}{2}{A_i}^\beta u\bigg\} \mathrm{d}u}.$

$\begin{align*} s_{i,\beta} = \frac{1}{\beta}+\frac{\varphi(1/\beta)+\ln 2}{\beta^2}-\left[\frac{1}{\beta^2}+\frac{1}{2}{A_i}^\beta\ln\bigg(\kappa^{ \mathrm{sign}(y-\mu)}\frac{|y-\mu|}{\sigma}\bigg)\right] \times\frac{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}-1}\ln u\exp\bigg\{-\frac{1}{2}{A_i}^\beta u\bigg\} \mathrm{d}u}{\int_{0}^{1}u^{\alpha+\frac{1}{\beta}-1}\exp\bigg\{-\frac{1}{2}{A_i}^\beta u\bigg\} \mathrm{d}u}. \end{align*}$

$\bullet$ For CAGN

$M_{i,k} = \lambda^{\frac{1}{\beta}+\kappa}\exp\bigg\{-\frac{\lambda}{2}{A_i}^\beta\bigg\}+(1-\alpha)\exp\bigg\{-\frac{1}{2}{A_i}^\beta\bigg\},$

and,

$s_{i,\alpha} = \frac{f_{AGN}(y_i;\mu,\lambda^{-\frac{1}{\beta}}\sigma,\kappa,\beta)-f_{AGN}(y_i;\mu,\sigma,\kappa,\beta)}{\alpha f_{AGN}(y_i;\mu,\lambda^{-\frac{1}{\beta}}\sigma,\kappa,\beta)+(1-\alpha)f_{AGN}(y_i;\mu,\sigma,\kappa,\beta)}.$

$s_{i,\lambda} = \frac{\alpha\left[\frac{1}{\beta\lambda}-\frac{1}{2}A_i^\beta\right]f_{AGN}(y_i;\mu,\lambda^{-\frac{1}{\beta}}\sigma,\kappa,\beta)}{\alpha f_{AGN}(y_i;\mu,\lambda^{-\frac{1}{\beta}}\sigma,\kappa,\beta)+(1-\alpha)f_{AGN}(y_i;\mu,\sigma,\kappa,\beta)}.$

The information-based approximation (11) is asymptotically applicable. The standard errors of $\hat{\Theta}$ can be obtained through the inverse of the empirical information matrix $\mathbf{I_0(\Theta|y)}$ , with a sufficiently large sample size. The accuracy of the estimation can be guaranteed and the complexity of calculation can be reduced.

References

[1]	D. F. Andrews, C. L. Mallows, Scale mixtures of normal distribution, J. R. Stat. Soc. Ser. B Methodol., 36 (1974), 99–102. https://doi.org/10.1111/j.2517-6161.1974.tb00989.x doi: 10.1111/j.2517-6161.1974.tb00989.x
[2]	M. West, On scale mixtures of normal distributions, Biometrika, 74 (1987), 646–648. https://doi.org/10.1093/biomet/74.3.646 doi: 10.1093/biomet/74.3.646
[3]	M. D. Branco, D. K. Dey, A general class of multivariate skew-elliptical distributions, J. Multivar. Anal., 79 (2001), 99–113. https://doi.org/10.1006/jmva.2000.1960 doi: 10.1006/jmva.2000.1960
[4]	C. S. Ferreira, H. Bolfarine, V. H. Lachos, Linear mixed models based on skew scale mixtures of normal distribution, Commun. Stat. Simul. Comput., 51 (2020), 7194–7214. https://doi.org/10.1080/03610918.2020.1827265 doi: 10.1080/03610918.2020.1827265
[5]	R. M. Basso, V. H. Lachos, C. R. B. Cabral, P. Ghosh, Robust mixture modeling based on scale mixtures of skew-normal distributions, Comput. Stat. Data Anal., 54 (2010), 2926–2941. https://doi.org/10.1016/j.csda.2009.09.031 doi: 10.1016/j.csda.2009.09.031
[6]	H. M. Kim, M. G. Genton, Characteristic functions of scale mixtures of multivariate skew-normal distributions, Comput. Stat. Data Anal., 102 (2011), 1105–1117. https://doi.org/10.1016/j.jmva.2011.03.004 doi: 10.1016/j.jmva.2011.03.004
[7]	T. I. Lin, J. C. Lee, W. J. Hsieh, Robust mixture modeling using the skew t distribution, Stat. Comput., 17 (2007), 81–92. https://doi.org/10.1007/s11222-006-9005-8 doi: 10.1007/s11222-006-9005-8
[8]	T. I. Lin, H. J. Ho, C. R. Lee, Flexible mixture modelling using the multivariate skew-t-normal distribution, Stat. Comput., 24 (2014), 531–546. https://doi.org/10.1007/s11222-013-9386-4 doi: 10.1007/s11222-013-9386-4
[9]	I. Lin, J. C. Lee, Y. Y. Shu, Finite mixture modelling using the skew normal distribution, Stat. Sin., 17 (2007), 909–927. https://doi.org/10.2307/24307705 doi: 10.2307/24307705
[10]	A. Mahdavi, V. Amirzadeh, A. Jamalizadeh, T. I. Lin, Maximum likelihood estimation for scale-shape mixtures of flexible generalized skew normal distributions via selection representation, Comput. Stat., 36 (2021), 2201–2230. https://doi.org/10.1007/s00180-021-01079-2 doi: 10.1007/s00180-021-01079-2
[11]	A. Azzalini, A class of distributions which includes the normal ones, Scand. J. Stat., 12 (1985), 171–178.
[12]	A. Azzalini, A. Dalla Valle, The multivariate skew-normal distribution, Biometrika, 83 (1996), 715–726. https://doi.org/10.1093/biomet/83.4.715 doi: 10.1093/biomet/83.4.715
[13]	A. Azzalini, A. Capitanio, Statistical applications of the multivariate skew normal distribution, J. R. Stat. Soc. Ser. B Methodol., 61 (1999), 579–602. https://doi.org/10.1111/1467-9868.00194 doi: 10.1111/1467-9868.00194
[14]	R. B. Arellano-Valle, A. Azzalini, On the unification of families of skew-normal distributions, Scand. J. Stat., 33 (2006), 561–574. https://doi.org/10.1111/j.1467-9469.2006.00503.x doi: 10.1111/j.1467-9469.2006.00503.x
[15]	R. B. Arellano-Valle, M. G. Genton, On fundamental skew distributions, J. Multivar. Anal., 96 (2005), 93–116. https://doi.org/10.1016/j.jmva.2004.10.002 doi: 10.1016/j.jmva.2004.10.002
[16]	A. Azzalini, The skew-normal distribution and related multivariate families, Scand. J. Stat., 32 (2005), 159–188. https://doi.org/10.1111/j.1467-9469.2005.00426.x doi: 10.1111/j.1467-9469.2005.00426.x
[17]	C. Fernandez, M. F. J. Steel, Reference priors for non-normal two-sample problems, Test, 7 (1988), 179–205. https://doi.org/10.1007/BF02565109 doi: 10.1007/BF02565109
[18]	D. M. Zhu, V. Zinde-Walsh, Properties and estimation of asymmetric exponential power distribution, J. Econom., 148 (2009), 86–99. https://doi.org/10.1016/j.jeconom.2008.09.038 doi: 10.1016/j.jeconom.2008.09.038
[19]	R. J. Guan, X. Zhao, C. H. Cheng, Y. H. Rong, A new generalized t distribution based on a distribution construction method, Mathematics, 9 (2021), 2413. https://doi.org/10.1016/10.3390/math9192413 doi: 10.1016/10.3390/math9192413
[20]	H. Exton, Handbook of hypergeometric integrals: Theory, applications, tables, computer programs, New York: Halsted Press, 1978.
[21]	A. P. Dempster, N. M. Laird, D. B. Rubin, Maximum likelihood from incomplete data via the em algorithm, J. R. Stat. Soc. Ser. B Methodol., 39 (1977), 1–22. https://doi.org/10.1111/j.2517-6161.1977.tb01600.x doi: 10.1111/j.2517-6161.1977.tb01600.x
[22]	K. Lange, The EM algorithm, New York: Springer, 2013.
[23]	X. L. Meng, D. B. Rubin, Maximum likelihood estimation via the ecm algorithm: A general framework, Biometrika, 80 (1993), 267–278. https://doi.org/10.2307/2337198 doi: 10.2307/2337198
[24]	P. Huber, Robust statistics, New York: Wiley, 1981.
[25]	L. L. Wen, Y. J. Qiu, M. H. Wang, J. L. Yin, P. Y. Chen, Numerical characteristics and parameter estimation of finite mixed generalized normal distribution, Commun. Stat. Simul. Comput., 51 (2022), 3596–3620. https://doi.org/10.1080/03610918.2020.1720733 doi: 10.1080/03610918.2020.1720733

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(1429) PDF downloads(82) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

AIMS Mathematics

The generalized scale mixtures of asymmetric generalized normal distributions with application to stock data

Related Papers:

Abstract

1. Introduction

2. The generalized scale mixtures of asymmetric generalized normal distributions

2.1. Definition and properties

2.2. Examples of the GSMAGN distributions

3. Order statistics

3.1. Order statistics of AGN

3.2. Order statistics of AGT

3.3. Order statistics of CAGN

4. Parameter estimation via the EM-type algorithm

5. Consistency properties

5.1. Existence and consistency of the MLE

5.2. Estimation of standard errors

6. Simulation study

7. Application

8. Conclusions

Use of AI tools declaration

Conflict of interest

Appendix

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. The generalized scale mixtures of asymmetric generalized normal distributions

2.1. Definition and properties

2.2. Examples of the GSMAGN distributions

3. Order statistics

3.1. Order statistics of AGN

3.2. Order statistics of AGT

3.3. Order statistics of CAGN

4. Parameter estimation via the EM-type algorithm

5. Consistency properties

5.1. Existence and consistency of the MLE

5.2. Estimation of standard errors

6. Simulation study

7. Application

8. Conclusions

Use of AI tools declaration

Conflict of interest

Appendix

References