Breast cancer diagnosis using feature extraction and boosted C5.0 decision tree algorithm with penalty factor

Jian-xue Tian; Jue Zhang; Jian-xue Tian; Jue Zhang

doi:10.3934/mbe.2022102

Mathematical Biosciences and Engineering

2022, Volume 19, Issue 3: 2193-2205. doi: 10.3934/mbe.2022102

Previous Article Next Article

Research article

Breast cancer diagnosis using feature extraction and boosted C5.0 decision tree algorithm with penalty factor

Jian-xue Tian ,
Jue Zhang ^,

School of Information Engineer, Yulin University, Road chongwen, Yulin 719000, China

Academic Editor: Xiao Wang

Received: 16 September 2021 Revised: 28 November 2021 Accepted: 14 December 2021 Published: 04 January 2022

To overcome the two class imbalance problem among breast cancer diagnosis, a hybrid method by combining principal component analysis (PCA) and boosted C5.0 decision tree algorithm with penalty factor is proposed to address this issue. PCA is used to reduce the dimension of feature subset. The boosted C5.0 decision tree algorithm is utilized as an ensemble classifier for classification. Penalty factor is used to optimize the classification result. To demonstrate the efficiency of the proposed method, it is implemented on biased-representative breast cancer datasets from the University of California Irvine(UCI) machine learning repository. Given the experimental results and further analysis, our proposal is a promising method for breast cancer and can be used as an alternative method in class imbalance learning. Indeed, we observe that the feature extraction process has helped us improve diagnostic accuracy. We also demonstrate that the extracted features considering breast cancer issues are essential to high diagnostic accuracy.

Keywords:

Citation: Jian-xue Tian, Jue Zhang. Breast cancer diagnosis using feature extraction and boosted C5.0 decision tree algorithm with penalty factor[J]. Mathematical Biosciences and Engineering, 2022, 19(3): 2193-2205. doi: 10.3934/mbe.2022102

Related Papers:

[1]	Baishuai Zuo, Chuancun Yin . Stein’s lemma for truncated generalized skew-elliptical random vectors. AIMS Mathematics, 2020, 5(4): 3423-3433. doi: 10.3934/math.2020221
[2]	Guangshuai Zhou, Chuancun Yin . Family of extended mean mixtures of multivariate normal distributions: Properties, inference and applications. AIMS Mathematics, 2022, 7(7): 12390-12414. doi: 10.3934/math.2022688
[3]	Remigijus Leipus, Jonas Šiaulys, Dimitrios Konstantinides . Minimum of heavy-tailed random variables is not heavy tailed. AIMS Mathematics, 2023, 8(6): 13066-13072. doi: 10.3934/math.2023658
[4]	Naif Alotaibi, A. S. Al-Moisheer, Ibrahim Elbatal, Salem A. Alyami, Ahmed M. Gemeay, Ehab M. Almetwally . Bivariate step-stress accelerated life test for a new three-parameter model under progressive censored schemes with application in medical. AIMS Mathematics, 2024, 9(2): 3521-3558. doi: 10.3934/math.2024173
[5]	Huifang Yuan, Tao Jiang, Min Xiao . The ruin probability of a discrete risk model with unilateral linear dependent claims. AIMS Mathematics, 2024, 9(4): 9785-9807. doi: 10.3934/math.2024479
[6]	Weiwei Ni, Chenghao Xu, Kaiyong Wang . Estimations for aggregate amount of claims in a risk model with arbitrary dependence between claim sizes and inter-arrival times. AIMS Mathematics, 2022, 7(10): 17737-17746. doi: 10.3934/math.2022976
[7]	Khaled M. Alqahtani, Mahmoud El-Morshedy, Hend S. Shahen, Mohamed S. Eliwa . A discrete extension of the Burr-Hatke distribution: Generalized hypergeometric functions, different inference techniques, simulation ranking with modeling and analysis of sustainable count data. AIMS Mathematics, 2024, 9(4): 9394-9418. doi: 10.3934/math.2024458
[8]	Gunduz Caginalp . Fat tails arise endogenously from supply/demand, with or without jump processes. AIMS Mathematics, 2021, 6(5): 4811-4846. doi: 10.3934/math.2021283
[9]	Hamid Reza Safaeyan, Karim Zare, Mohamadreza Mahmoudi, Mohsen Maleki, Amir Mosavi . A Bayesian approach on asymmetric heavy tailed mixture of factor analyzer. AIMS Mathematics, 2024, 9(6): 15837-15856. doi: 10.3934/math.2024765
[10]	Yanfang Zhang, Fuchang Wang, Yibin Zhao . Statistical characteristics of earthquake magnitude based on the composite model. AIMS Mathematics, 2024, 9(1): 607-624. doi: 10.3934/math.2024032

Abstract

1. Introduction and motivation

One of the main challenges faced by financial companies is to evaluate market risks in a set of changes of the basic variables such as stock prices, interest rates or exchange rates. In this regard the Value-at-Risk ( $\text{VaR}$ ) introduced by J. P. Morgan in the mid 1990s has become a standard risk measure of financial market risk. Despite its extensive use, the $\text{VaR}$ is not a coherent risk measure because it fails to satisfy subadditivity property (see ^[1]). The $\text{VaR}$ can not determine the expected loss of portfolio in $q$ worst case, but it defines the minimum loss. Furthermore, the computation of the $\text{VaR}$ is based on the assumption that financial data returns follow the normal distribution. However, as shown in the literatures, the underlying distributions of many financial data exhibit skewness, non-symmetric, heavy tails and excess kurtosis (see ^[10]). They suggest in particular that large losses occur with much higher probability than the normal distribution would suggest.

The tail conditional expectation (TCE) risk measure shares properties that are considered desirable in all cases. For instance, due to the additivity of expectations, $\text{TCE}$ allows venture capital to decompose naturally among its various components.

Consider $X$ to be a loss random variable whose cumulative distribution function (cdf) is denoted by $F_X(x)$ . The $\text{TCE}$ is defined as

$\begin{align*} \text{TCE}_p(X) = \text{E}(X|X > x_p), \; p\in(0, 1), \end{align*}$

where $x_p =$ inf ${\{x\in \Bbb{R}\; :\; F_X(x)\geq p\}} = \text{VaR}_p(X)$ . The $\text{TCE}$ has been discussed in many literatures (see e.g., ^{[11,12,15,17]}).

The tail conditional expectation risk measure shares properties that are considered desirable in a variety of situations. For instance, due to the additivity of expectations, $\text{TCE}$ allows for a natural decomposition of risk capital among its various constituents. The conception of capital allocation principle has long been introduced, in which the capital allocated to each risk unit can be expressed as its contribution to the tail conditional expectation of total risk. Risk allocation can not only help to evaluate and compare the performance of individual risk units, but also help to understand the risk contribution of each unit towards the total risk of the portfolio. Landsman and Valdez ^[17] derived the portfolio risk decomposition with $\text{TCE}$ for the multivariate elliptical distribution. In ^[18], authors derived the portfolio risk decomposition with $\text{TCE}$ for the exponential dispersion model, and Kim ^[14] for the exponential family class. The allocation for the class of exponential marginal was developed in ^[]. The portfolio risk decomposition with $\text{TCE}$ was further considered in ^[19] for the skew-normal distribution. Furman and Landsman ^[16] for the multivariate Gamma distribution. Cai and Li ^[2] for the phase-type distribution. Goovaerts et al. ^[9] and Chiragiev and Landsman ^[4] have provided the $\text{TCE}$ -capital allocation for the multivariate Pareto distribution while Cossette et al. ^[5] have considered multivariate compound distribution. Ignatieva and Landsman ^[12] for generalized hyperbolic distribution. Recently, Kim and Kim ^[15] and Ignatieva and Landsman ^[12] investigated the $\text{TCE}$ allocation for the family of multivariate normal mean-variance mixture distributions and skewed generalized hyperbolic, respectively. The univariate $\text{TCE}$ and risk allocation formula for the generalized hyper-elliptical class were available in ^[13].

Furman and Landsman ^[8] observed that in many cases the $\text{TCE}$ does not provide adequate information about the risks on the right tail. This point can be confirmed by the fact that the $\text{TCE}$ does not include the information that the risk deviates from the upper tail expectation. Furman and Landsman ^[8] introduced the tail variance measure. The tail variance is defined as

$\begin{align*} \text{TV}_p(X) = \text{Var}(X|X > x_p) = \text{E}\left((X- \text{TCE}_p(X))^2|X > x_p\right), \end{align*}$

and it has been discussed in many literatures (see e.g., ^[8,15]).

In this paper we consider a class of multivariate location-scale mixtures of elliptical ( $\text{LSME}$ ) distributions which is known to be extremely flexible and contains many special cases as its members. Examples include the generalized hyper-elliptical distribution and generalized hyperbolic distribution.

The rest of the paper is organized as follows. Section 2 reviews the definition and properties of the multivariate $\text{LSME}$ class, and introduces the generalized hyper-elliptical distribution as a representative subclass. Section 3 presents a theorem and proves the proposed $\text{TCE}$ formula for the $\text{LSME}$ and in Section 4, the development is extended to the portfolio risk decomposition with $\text{TCE}$ for the multivariate $\text{LSME}$ . In Section 5, we develop $\text{TV}$ formula for univariate $\text{LSME}$ . Section 6 deals with the special case of generalized hyperbolic distribution. Numercial illustration is presented in Section 7. Finally, concluding remarks are presented in Section 8.

2. Mixture of elliptical distributions

In this section, we introduce the class of location-scale mixtures of elliptical distributions and some of its properties.

Let $\boldsymbol{\Psi}_n$ be a class of functions $\psi(t): [0, \infty)\rightarrow \Bbb{R}$ such that function $\psi(\sum_{i = 1}^{n}t_{i}^{2})$ is an $n$ -dimensional characteristic function.

A random vector $\boldsymbol{Y}$ is said to have a multivariate elliptical distribution, denoted by $\boldsymbol{Y}\sim \text{E}_n(\boldsymbol{\mu}, \boldsymbol{\Sigma}, \psi)$ , if its characteristic function can be expressed as

$\begin{align} \varphi_{\boldsymbol{Y}}(\boldsymbol{t}) = \exp(i\boldsymbol{t}^{T}\boldsymbol{\mu})\psi(\frac{1}{2}\boldsymbol{t}^{T}\boldsymbol{\Sigma}\boldsymbol{t}), \end{align}$

(2.1)

for column-vector vector $\boldsymbol{\mu}$ , $n\times n$ positive definite scale matrix $\boldsymbol{\Sigma}$ , and for function $\psi(t)\in \boldsymbol{\Psi}_n$ , which is called the characteristic generator.

In general, a multivariate elliptical distribution may not have a probability density function (pdf), but if its pdf exists then the form will be

$\begin{align} f_{\boldsymbol{Y}}(\boldsymbol{y}) = \frac{c_n}{\sqrt{|\boldsymbol{\boldsymbol{\Sigma}}|}}g_n[\frac{1}{2}(\boldsymbol{y}-\boldsymbol{\mu})^{T}\boldsymbol{\Sigma}^{-1}(\boldsymbol{y}-\boldsymbol{\mu})], \end{align}$

(2.2)

for function $g_n(\cdot)$ , which is called the density generator. The condition

$\begin{align} \int_{0}^{\infty} u^{\frac{n}{2}-1}g_n(u)du < \infty \end{align}$

(2.3)

guarantees $g_n(u)$ to be the density generator (^[7]). In addition, the normalizing constant $c_n$ is

$\begin{align*} c_n = \frac{\Gamma(\frac{n}{2})}{(2\pi)^{\frac{n}{2}}}\left(\int_{0}^{\infty}u^{\frac{n}{2}-1}g_n(u)du\right)^{-1}. \end{align*}$

Similarly, the elliptical distribution can also be introduced by the density generator and then written $\boldsymbol{Y}\sim \text{E}_n(\boldsymbol{\mu}, \boldsymbol{\Sigma}, g_n)$ .

From (2.1), it follows that, if $\boldsymbol{Y}\sim \text{E}_n(\boldsymbol{\mu}, \boldsymbol{\Sigma}, g_n)$ and $A$ is $m\times n$ matrix of rank $m\leq n$ and $\boldsymbol{b}$ is $m$ -dimensional column-vector, then

$\begin{align*} A\boldsymbol{Y}+\boldsymbol{b}\sim \text{E}_m(A\boldsymbol{\mu}+\boldsymbol{b}, A\boldsymbol{\Sigma}A^{T}, g_m). \end{align*}$

The following condition:

$\begin{align*} \int_{0}^{\infty}g_1(u)du < \infty \end{align*}$

guarantees the existence of the mean. If, in addition, $|\psi'(0)| < \infty$ , the covariance matrix exists and is equal to

$\begin{align*} \text{Cov}(\boldsymbol{Y}) = -\psi'(0)\boldsymbol{\Sigma}, \end{align*}$

(see ^[3]).

From (2.2) and (2.3), $g_{1}(x)$ can be a density generator of univariate elliptical distribution of the random variable $Y\sim \text{E}_{1}(\mu, \sigma^2, g_{1})$ whose $\text{pdf}$ can be expressed as

$\begin{align*} f_{Y}(y) = \frac{c}{\sigma}g_{1}\left(\frac{1}{2}\left(\frac{y-\mu}{\sigma}\right)^2\right), \end{align*}$

where $c$ is the normalizing constant. In this paper, we assume

$\begin{align} \text{Var}(Z) = \sigma_{Z}^{2} < \infty, \end{align}$

(2.4)

where $Z = \frac{Y-\mu}{\sigma}$ is the spherical random variable. The $\text{cdf}$ of the random variable $Z$ can be written as the following integration form:

$\begin{align*} F_{Z}(z) = c\int_{-\infty}^{z}g_{1}(\frac{1}{2}u^2)du. \end{align*}$

We can obtain the mean and variance of $Z$ :

$\begin{align*} \mu_{Z} = 0 \end{align*}$

and

$\begin{align*} \sigma_{Z}^{2} = 2c\int_{0}^{\infty}u^{2}g_{1}(\frac{1}{2}u^2)du = -\psi^{'}(0). \end{align*}$

Landsman and Valdez ^[17] showed that

$\begin{align} f_{Z^{*}}(z) = \frac{1}{\sigma_{Z}^{2}}\overline{G}(\frac{1}{2}z^{2}) \end{align}$

(2.5)

is the density of another spherical random variable $Z^{*}$ associated with $Z$ , where

$\begin{align*} \overline{G}(z) = \int_{z}^{\infty}g_{1}(u)du. \end{align*}$

The random vector $\boldsymbol X\sim$ LSME $_n(\boldsymbol{\mu}, \boldsymbol{\Sigma}, \boldsymbol{\gamma}, g_n;\Pi)$ has an $n$ -dimensional $\text{LSME}$ distribution with location parameter $\boldsymbol{\mu}$ , positive definite scale matrix $\boldsymbol{\Sigma}$ , if

$\begin{align} \boldsymbol X = \boldsymbol {m}(\Theta)+\Theta^{\frac{1}{2}}\boldsymbol{\Sigma}^{\frac{1}{2}}\boldsymbol{Y}, \end{align}$

(2.6)

in distribution, where

(1) $\boldsymbol Y \sim \text{E}_n(\boldsymbol0, \boldsymbol I_n, g_n)$ , the $n$ -dimensional multivariate elliptical variable;

(2) Non-negative scalar random variable $\Theta$ is independent of $\boldsymbol{Y}$ , whose $\text{pdf}$ and $\text{cdf}$ are $\pi(\theta)$ and $\Pi(\theta)$ respectively;

(3) $\boldsymbol {m}(\Theta) = \boldsymbol {\mu}+\Theta \boldsymbol {\gamma}$ , where $\boldsymbol {\mu} = (\mu_{1}, \ldots, \mu_n)^{T} \; \text{and} \; \boldsymbol {\gamma} = (\gamma_{1}, \ldots, \gamma_n)^{T}$ are constant vectors in $\Bbb R^{n}$ .

The pdf of the $\text{LSME}$ can be written as the following integration form:

$\begin{align*} \nonumber f_{\boldsymbol{X}}(\boldsymbol{x})& = \frac{c_n}{\sqrt{|\Sigma|}}\int_{0}^{\infty}\frac{1}{\sqrt{\theta}}g_n\left(\frac{1}{2\theta}(\boldsymbol{x}-\boldsymbol{\mu}-\theta\boldsymbol{\gamma})^{T}\boldsymbol{\Sigma}^{-1}(\boldsymbol{x}-\boldsymbol{\mu}-\theta\boldsymbol{\gamma})\right)\pi(\theta)d\theta, \; \; \; \boldsymbol{x}\in \Bbb R^{n}. \end{align*}$

We find that the conditional distribution of $\boldsymbol{X}|\theta$ is elliptical, that is

$\begin{align} \boldsymbol X|\Theta = \theta\; \; \sim\; \; \text{E}_n\left(\boldsymbol m(\theta), \theta \boldsymbol {\Sigma}, g_n\right). \end{align}$

(2.7)

We can obtain the mean and covariance of $\boldsymbol{X}$ :

$\begin{align*} \text{E}(\boldsymbol{X}) = \text{E}[ \text{E}(\boldsymbol{X}|\Theta)] = \text{E}\left(\boldsymbol{m}(\Theta)\right) = \boldsymbol{\mu}+ \text{E}(\Theta)\boldsymbol{\gamma} \end{align*}$

and

$\begin{align*} \nonumber \text{Cov}(\boldsymbol{X}) = & \text{E}\left[ \text{Var}(\boldsymbol{X}|\Theta)\right]+ \text{Var}\left[ \text{E}(\boldsymbol{X}|\Theta)\right] = \text{E}(\Theta \boldsymbol{\Sigma})+ \text{Var}\left(\boldsymbol{m}(\Theta)\right)\\ = & \text{E}(\Theta)\boldsymbol{\Sigma}+ \text{Var}(\Theta)\boldsymbol{\gamma} \boldsymbol{\gamma^{T}}. \end{align*}$

The characteristic function of $\boldsymbol{X}|\Theta = \theta$ exists and equals to

$\begin{align*} \varphi_{\boldsymbol{X}}(\boldsymbol{t}|\Theta = \theta) = \exp(i\boldsymbol{t}^{T}\boldsymbol{\mu})\exp(i\theta\boldsymbol{t}^{T}\boldsymbol{\gamma})\psi(\frac{1}{2}\theta\boldsymbol{t}^{T}\boldsymbol{\Sigma}\boldsymbol{t}). \end{align*}$

Then the characteristic function of the $\text{LSME}$ -distributed random vector $\boldsymbol{X}$ can be written as

$\begin{align} \varphi_{\boldsymbol{X}}(\boldsymbol{t}) = &\exp(i\boldsymbol{t}^{T}\boldsymbol{\mu}) \text{E}\left[\exp(i\theta\boldsymbol{t}^{T}\boldsymbol{\gamma})\psi(\frac{1}{2}\theta\boldsymbol{t}^{T}\boldsymbol{\Sigma}\boldsymbol{t})\right]\\ = &\exp(i\boldsymbol{t}^{T}\boldsymbol{\mu})\int_{0}^{\infty}\exp(i\theta\boldsymbol{t}^{T}\boldsymbol{\gamma})\psi(\frac{1}{2}\theta\boldsymbol{t}^{T}\boldsymbol{\Sigma}\boldsymbol{t})\pi(\theta)d\theta. \end{align}$

(2.8)

Under the condition (2.2) and from (2.5), we can conclude $\overline{G}(z)$ is the density generator of the associated elliptical variable $Z^*$ , then

$\begin{align} X^{*} = m(\Theta)+\sqrt{\Theta}\sigma{Z^{*}} \end{align}$

(2.9)

is said to have a univariate $\text{LSME}$ distributions, denoted by $X^{*}\sim \text{LSME}_{1}(\mu, \sigma^{2}, \gamma, \Theta, \overline{G}; \Pi)$ .

Proposition 1. If $\boldsymbol X\sim \text{LSME}_n(\boldsymbol{\mu}, \boldsymbol{\Sigma}, \boldsymbol{\gamma}, g_n; \Pi)$ and $\boldsymbol{Y} = B\boldsymbol{X}+\boldsymbol{b}$ where $B$ is $m\times n$ ( $m\leq n$ ) matrix and $\boldsymbol{b}$ is $m$ -dimensional column-vector, then it holds that $\boldsymbol Y\sim \text{LSME}_m(B\boldsymbol{\mu}+\boldsymbol{b}, B\boldsymbol{\Sigma}B^T, B\boldsymbol{\gamma}, g_m; \Pi).$

Proof. Using the characteristic function (2.8), we write

$\begin{align} \varphi_{\boldsymbol{Y}}&(\boldsymbol{t}) = \text{E}(e^{i\boldsymbol{t}^{T}(B\boldsymbol{X}+\boldsymbol{b})})\\ = &\exp(i\boldsymbol{t}^{T}\boldsymbol{b})\varphi_{\boldsymbol{X}}(B^{T}\boldsymbol{t})\\ = &\exp(i\boldsymbol{t}^{T}\boldsymbol{b})\exp\left(i(B^{T}\boldsymbol{t})^{T}\boldsymbol{\mu}\right)\int_{0}^{\infty}\exp\left(i\theta(B^{T}\boldsymbol{t})^{T}\boldsymbol{\gamma}\right)\psi\left(\frac{1}{2}\theta(B^{T}\boldsymbol{t})^{T}\boldsymbol{\Sigma}(B^{T}\boldsymbol{t})\right)\pi(\theta)d\theta\\ = &\exp\left(i\boldsymbol{t}^{T}(B\boldsymbol{\mu}+\boldsymbol{b})\right)\int_{0}^{\infty}\exp(i\theta\boldsymbol{t}^{T}B\boldsymbol{\gamma})\psi(\frac{1}{2}\theta\boldsymbol{t}^{T}B\boldsymbol{\Sigma}B^T\boldsymbol{t})\pi(\theta)d\theta, \end{align}$

(2.10)

i.e., $\boldsymbol Y\sim \text{LSME}_m(B\boldsymbol{\mu}+\boldsymbol{b}, B\boldsymbol{\Sigma}B^T, B\boldsymbol{\gamma}, g_m; \Pi).$

$\bf{Example\; 2.1}$ (Generalized hyper-elliptical distribution). The $\text{GHE}$ distribution is constructed by mixing a generalized inverse Gaussian distribution with elliptical distribution. A positive random variable $\Theta$ is said to have a generalized inverse Gaussian distribution, denoted by $\Theta \sim \text{GIG}(\lambda, a, b)$ , if its pdf is given by

$\begin{align} \pi(\theta;\lambda, a, b) = \frac{a^{-\lambda}(\sqrt{ab})^{\lambda}}{2K_{\lambda}(\sqrt{ab})}\theta^{\lambda -1}\exp\left(-\frac{1}{2}(a\theta^{-1}+b\theta)\right), \; \; \; \theta > 0, \end{align}$

(2.11)

where parameters follow

$\begin{align*} \left\{ \begin{array}{lcl} a\geq0 & \text{and}& b > 0, \; \; \text{if}\; \; \lambda > 0, \\ a > 0 & \text{and}& b\geq0, \; \; \text{if}\; \; \lambda < 0, \\ a > 0 & \text{and}& b > 0, \; \; \text{if}\; \; \lambda = 0 \end{array} \right. \end{align*}$

and $K_{\lambda}(\cdot)$ denotes the modified Bessel function of the second kind with index $\lambda\in\Bbb{R}$ . A random vector $\boldsymbol{X}\sim \text{GHE}_n(\boldsymbol{\mu}, \boldsymbol{\Sigma}, \boldsymbol{\gamma}, g_{n}, \lambda, a, b)$ has an $n$ -dimensional $\text{GHE}$ distribution, if there exists a random vector $\boldsymbol{Y}$ follows (2.6) such that

$\begin{align} \boldsymbol X = \boldsymbol {m}(\Theta)+\Theta^{\frac{1}{2}}\boldsymbol{\Sigma}^{\frac{1}{2}}\boldsymbol{Y}, \end{align}$

(2.12)

where $\Theta\sim \text{GIG}(\lambda, a, b)$ .

3. Tail conditional expectation for univariate case

The univariate $\text{LSME}$ variable is given by $n = 1$ in the multivariate definition. That is, the univariate $\text{LSME}$ variable $X\sim \text{LSME}_1(\mu, \sigma^2, \gamma, g_1;\Pi)$ satisfies

$\begin{align*} X = m(\Theta)+\sqrt\Theta \sigma Z, \end{align*}$

where $Z\sim \text{E}_1(0, 1, g_1)$ is the standard elliptical variable, and non-negative scalar random variable $\Theta$ with pdf $\pi(\theta)$ is independent of $Z$ . From (2.7), we have

$\begin{align} X|\theta \sim \text{E}_1\left(m(\theta), \theta \sigma^2, g_1\right). \end{align}$

(3.1)

Assuming that both the conditional distribution and the mixed distribution are continuous, the $\text{pdf}$ of $X$ produced by the mixed distribution can be written as

$\begin{align} f_X(x) = \int_{\Omega_{\theta}}f(x|\theta)\pi(\theta)d\theta, \end{align}$

(3.2)

where $f(x|\theta)$ is the $\text{pdf}$ of $X|\theta$ and $\Omega_{\theta}$ is the support of $\pi(\theta)$ . Now let $x_p$ be the quantile of the LSME variable $X$ . Then the TCE of $X$ can be expressed as

$\begin{align} \text{E}(&X|X > x_p)\\ & = \frac{1}{1-p}\int_{x_p}^{\infty}xf_{X}(x)dx\\ & = \frac{1}{1-p}\int_{\Omega_{\theta}}\int_{x_p}^{\infty}xf(x|\theta)dx\; \pi(\theta)d\theta\\ & = \frac{1}{1-p}\int_{\Omega_{\theta}} \text{E}_{X|\theta}(X|X > x_p)\overline{F}_{X|\theta}(x_p)\pi({\theta})d\theta. \end{align}$

(3.3)

The $\text{TCE}$ formula for a univariate elliptical distribution is introduced by ^[17], and equals to

$\begin{align} \text{TCE}_{p}(X|\theta) = & \text{E}_{X|\theta}(X|X > x_p)\\ = &m(\theta)+\frac{\frac{1}{\sqrt \theta \sigma}f_{Z^{\ast}}\left(\kappa(x_p;\theta)\right)}{\overline{F}_Z\left(\kappa(x_p;\theta)\right)}\sigma_{Z}^2 \theta \sigma^2\\ = &m(\theta)+\frac{\frac{1}{\sqrt \theta \sigma}f_{Z^{\ast}}\left(\kappa(x_p;\theta)\right)}{\overline{F}_Z\left(\kappa(x_p;\theta)\right)} \text{Var}(X|\theta), \end{align}$

(3.4)

where

$\begin{align*} \kappa(x;\theta) = \frac{x-m(\theta)}{\sqrt{\theta}\sigma}. \end{align*}$

We now give a general $\text{TCE}$ formula for the univariate $\text{LSME}$ distributions.

Theorem 1. Let $X\sim \text{LSME}_1(\mu, \sigma^2, \gamma, g_1;\Pi)$ and $\pi'(\theta) = (c')^{-1}\theta \pi(\theta)$ be a mixing $\text{pdf}$ with $c' = \text{E}(\Theta) < \infty$ . Then the $\text{TCE}$ of $X$ can be computed by:

$\begin{align} \text{E}&(X|X > x_p)\\ & = \mu+\frac{\gamma c'}{1-p}\overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_{1};\Pi')+\frac{c'\sigma^{2}\sigma_{Z}^{2}}{1-p}f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi'), \end{align}$

(3.5)

where $\Pi'$ is the $\text{cdf}$ corresponding to the $\text{pdf}$ $\pi'$ .

Proof. From (3.3) and (3.4), we have

$\begin{align} \text{E}(X|X > x_p) = &\frac{1}{1-p}\int_{0}^{\infty}\left[m(\theta)+\frac{\frac{1}{\sqrt \theta \sigma}f_{Z^{\ast}}\left(\kappa(x_p;\theta)\right)}{\overline{F}_Z\left(\kappa(x_p;\theta)\right)} \text{Var}(X|\theta)\right]\overline{F}_{X|\theta}(x_p)\pi({\theta})d\theta\\ = &\frac{1}{1-p}\int_{x_p}^{\infty}\int_{0}^{\infty}\frac{m(\theta)}{\sqrt{\theta}\sigma}f_{Z}\left(\kappa(x;\theta)\right)\pi(\theta)d\theta dx\\ &+\frac{\sigma^{2}\sigma_{Z}^{2}}{1-p}\int_{0}^{\infty}\frac{f_{Z^{\ast}}\left(\kappa(x_p;\theta)\right)}{\sqrt\theta \sigma}\theta\pi(\theta) d\theta\\ = &\frac{1}{1-p}\int_{x_p}^{\infty}\mu f(x)dx+\frac{\gamma}{1-p}\int_{x_p}^{\infty} \int_{0}^{\infty}f(x|\theta)\theta \pi(\theta)d\theta dx\\ & +\frac{\sigma^{2}\sigma_{Z}^{2}}{1-p}\int_{0}^{\infty}\frac{f_{Z^{\ast}}\left(\kappa(x_p;\theta)\right)}{\sqrt\theta \sigma}\theta\pi(\theta) d\theta. \end{align}$

(3.6)

From the definition of $\text{LSME}$ distributions and (3.2), we have

$\begin{align*} \int_{0}^{\infty}f(x|\theta)\theta \pi(\theta) d\theta = c'\int_{0}^{\infty}f(x|\theta)\pi'(\theta)d\theta = c'f_{ \text{LSME}, 1}(x;\mu, \sigma^{2}, \gamma, g_1;\Pi'). \end{align*}$

As a result, (3.6) can be further simplified

$\begin{align*} \nonumber \text{E}(&X|X > x_p)\\ \nonumber & = \mu+\frac{\gamma c'}{1-p}\int_{x_p}^{\infty} \int_{0}^{\infty}f(x|\theta) \pi'(\theta)d\theta dx+\frac{c'\sigma^{2}\sigma_{Z}^{2}}{1-p}\int_{0}^{\infty}\frac{f_{Z^{\ast}}\left(\kappa(x_p;\theta)\right)}{\sqrt\theta \sigma}\pi'(\theta) d\theta\\ \nonumber & = \mu+\frac{\gamma c'}{1-p}\int_{x_p}^{\infty}f_{ \text{LSME}, 1}(x;\mu, \sigma^{2}, \gamma, g_1;\Pi')dx+\frac{c'\sigma^{2}\sigma_{Z}^{2}}{1-p}\int_{0}^{\infty}\frac{f_{Z^{\ast}}\left(\kappa(x_p;\theta)\right)}{\sqrt\theta \sigma}\pi'(\theta) d\theta\\ \nonumber & = \mu+\frac{\gamma c'}{1-p}\overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_{1};\Pi')+\frac{c'\sigma^{2}\sigma_{Z}^{2}}{1-p} f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi') . \end{align*}$

Corollary 1. Let $X\sim \text{GHE}_1(\mu, \sigma^2, \gamma, g_1, \lambda, a, b)$ . Assume the conditions in Theorem 1 are satisfied, then the $\text{TCE}$ of $\text{GHE}$ can be computed by:

$\begin{align} \text{TCE}_{p}(X) = &\mu+\frac{\gamma }{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\overline{F}_{ \text{GHE}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_{1}, \lambda+1, a, b)\\ &+\frac{\sigma^{2}\sigma_{Z}^{2}}{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}f_{ \text{GHE}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G}, \lambda+1, a, b). \end{align}$

(3.7)

Proof. From the GIG density in (2.11), we conclude

$\begin{align*} \theta\pi(\theta;\lambda, a, b) = \sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\pi(\theta;\lambda+1, a, b), \end{align*}$

by setting

$\begin{align*} c' = \sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}, \end{align*}$

then

$\begin{align*} \pi'(\theta) = (c')^{-1}\theta\pi(\theta) = \pi(\theta;\lambda+1, a, b), \end{align*}$

which also is the GIG density. Using (3.5) we can directly obtain (3.7).

4. Portfolio risk decomposition with ${TCE}$ for multivariate ${LSME}$

Consider a risk vector $\boldsymbol Y = (Y_1, \ldots, Y_n)^{T}$ and $S = Y_1 +\ldots+Y_n$ . We denote $s_p$ as the $p$ -quantile of $S$ , then

$\begin{align*} \text{E}(S|S > s_p) = \sum\limits_{i = 1}^{n} \text{E}(Y_i|S > s_p), \end{align*}$

where $\text{E}(Y_i|S > s_p)$ is the contribution of the $i$ -th risk to the aggregated risks.

Let $\boldsymbol Y = (Y_1, \ldots, Y_n)\sim \text{E}_n(\boldsymbol{\mu}, \boldsymbol{\Sigma}, g_n)$ and $S = Y_1 +\ldots+Y_n$ , then (^[6])

$\begin{align*} \text{E}(Y_{i}|S = s)& = \int_{-\infty}^{\infty}y_if(y_i|s)dy_i\\ & = \text{E}(Y_i)+\frac{ \text{Cov}(Y_i, \; S)}{ \text{Var}(S)}\left(s- \text{E}(S)\right). \end{align*}$

The contribution of risk $Y_i, \; 1\leq i \leq n,$ to the total $\text{TCE}$ can be expressed as

$\begin{align*} \nonumber \text{E}(Y_{i}|S > s_p)& = \int_{s_p}^{\infty} \text{E}(Y_{i}|S = s)dF_S(s|S > s_p)\\ \nonumber & = \int_{s_p}^{\infty} \text{E}(Y_{i}|S = s)\frac{f_S(s)}{1-F_S(s_p)}ds\\ & = \frac{1}{1-p}\int_{s_p}^{\infty} \text{E}(Y_{i}|S = s)f_S(s)ds. \end{align*}$

We now exploit this formulation to the multivariate $\text{LSME}$ to obtain its portfolio risk decomposition with $\text{TCE}$ .

Let us assume $\boldsymbol{X} = (X_1, \ldots, X_n)^{T}\sim \text{LSME}_{n}(\boldsymbol{\mu}, \boldsymbol{\Sigma}, \boldsymbol{\gamma}, g_{n}; \Pi)$ . Denote the $(i, j)$ element of $\boldsymbol{\Sigma}$ by $\sigma_{ij}$ , define

$\begin{align*} S = X_1 +\ldots+X_n. \end{align*}$

Then, $\text{E}(X_i|S = s)$ can be further expanded by conditioning on $\theta$ as follows:

$\begin{align} \text{E}(X_i|S = s)& = \int_{-\infty}^{\infty}x_if(x_i|s)dx_i = \frac{\int_{-\infty}^{\infty}x_if(x_{i}s)dx_i}{f_S(s)}\\ & = \frac{1}{f_S(s)}\int_{-\infty}^{\infty}x_i\int_{0}^{\infty}f(x_i, s|\theta)d\pi(\theta) dx_i\\ & = \frac{1}{f_S(s)}\int_{-\infty}^{\infty}x_i\int_{0}^{\infty}f(x_i|s, \theta)f(s|\theta)\pi(\theta)d\theta dx_i\\ & = \frac{1}{f_S(s)}\int_{0}^{\infty}\left[\int_{-\infty}^{\infty}x_if(x_i|s, \theta)dx_i\right]f(s|\theta)\pi(\theta)d\theta. \end{align}$

(4.1)

To deal with the inner integral, we define a matrix $B_{i}$ of size $2\times n$ :

$\begin{align} B_{i} = \left[ \begin{array}{cccccccc} 0& 0&\ldots &0&1&0&\ldots&0 \\ 1& 1&\ldots&\ldots&\ldots&\ldots&\ldots&1 \end{array} \right ] . \end{align}$

(4.2)

The first row vector has 1 in the $i$ th position. If we keep the general form

$\begin{align*} \boldsymbol{m}(\theta) = (m_1(\theta), \ldots, m_n(\theta))^{T}, \end{align*}$

we have

$B_{i}\boldsymbol X|\theta = (X_i, S|\theta)^{T} = B_{i}\boldsymbol{m}(\theta)+\theta^{\frac{1}{2}}B_{i}\boldsymbol{\Sigma}^{\frac{1}{2}}\boldsymbol{Y},$

here $(X_i, S|\theta)^{T}$ stands for a random column vector of size $2\times 1$ , with each element being $X_i|\theta$ and $S|\theta$ , respectively. Thus, the joint distribution of $(X_i, S)$ under the condition of $\Theta = \theta$ is a bivariate elliptical distribution

$(X_i, S|\theta)^{T}\sim \text{E}_2\left(B_{i}\boldsymbol{m}(\theta), \theta B_{i}\Sigma B_{i}^{T}, g_2\right),$

where the mean vector and convariance matrix of $(X_i, S|\theta)$ are given by

$\begin{align} \text{E}(B_{i}\boldsymbol{X}|\theta) = B_{i}\boldsymbol{m}(\theta) = \left( \text{E}(X_i|\theta), \; \text{E}(S|\theta)\right)^{T} = \left[ \begin{array}{c} m_i(\theta)\\ \nonumber \sum_{j = 1}^{n}m_j(\theta) \end{array} \right], \end{align}$

$\begin{align} \text{Cov}(B_{i}\boldsymbol{X}|\theta) = &-\psi^{'}(0)\theta B_{i}\Sigma B_{i}^{T}\\ \\ = &-\psi'(0)\left[ \begin{array}{cc} \theta \sigma_{ii}&\theta \sum_{j = 1}^{n}\sigma_{ij}\\ \theta \sum_{j = 1}^{n}\sigma_{ij}&\theta \sigma_S^2\\ \end{array} \right], \end{align}$

where

$\begin{align*} \sigma_S^{2} = \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1} = \sum\limits_{i = 1}^{n}\sum\limits_{j = 1}^{n}\sigma_{ij}. \end{align*}$

Therefore, if we impose another condition on $S$ , we see that $f(x_i|s, \theta)$ is an elliptical density. In particular

$\begin{align*} \nonumber\int_{-\infty}^{\infty}x_if(x_i|s, \theta)dx_i = & \text{E}(X_i|S = s, \; \Theta = \theta)\\ \nonumber = & \text{E}(X_i|\theta)+\frac{ \text{Cov}(X_i, \; S|\theta)}{ \text{Var}(S|\theta)}\left(s- \text{E}(S|\theta)\right)\\ \nonumber = &m_i(\theta)+\frac{-\psi'(0)\sum_{j = 1}^{n}\sigma_{ij}}{-\psi'(0)\sigma_{S}^2}\left(s-\sum\limits_{j = 1}^{n}m_j(\theta)\right)\\ = &m_i(\theta)+\frac{\sum_{j = 1}^{d}\sigma_{ij}}{\sigma_{S}^2}\left(s-\sum\limits_{j = 1}^{n}m_j(\theta)\right). \end{align*}$

Consequently

$\begin{align} \text{E}(X_i|S = s) = &\frac{1}{f_S(s)}\int_{0}^{\infty}\left[\int_{-\infty}^{\infty}x_if(x_i|s, \theta)dx_i\right]f(s|\theta)\pi(\theta)d\theta\\ = &\frac{1}{f_S(s)}\int_{0}^{\infty}\left[m_i(\theta)+\frac{\sum_{j = 1}^{n}\sigma_{ij}}{\sigma_{S}^2}\left(s-\sum\limits_{j = 1}^{n}m_j(\theta)\right)\right]\\ &\times\frac{1}{\sqrt\theta\sigma_S}f_Z\left(\frac{s-\sum_{j = 1}^{n}{m_j(\theta)}}{\sqrt\theta\sigma_S}\right)\pi(\theta)d\theta. \end{align}$

(4.3)

Eventually

$\begin{align} \text{E}(X_i|S > s_p) = &\frac{1}{1-p}\int_{s_p}^{\infty} \text{E}(X_i|S = s)f_S(s)ds\\ = &\frac{1}{1-p}\int_{s_p}^{\infty}\int_{0}^{\infty}\left[m_i(\theta)+\frac{\sum_{j = 1}^{n}\sigma_{ij}}{\sigma_{S}^2}\left(s-\sum\limits_{j = 1}^{n}m_j(\theta)\right)\right]\\ &\times\frac{1}{\sqrt\theta\sigma_S}f_Z\left(\frac{s-\sum_{j = 1}^{n}{m_j(\theta)}}{\sqrt\theta\sigma_S}\right)\pi(\theta)d\theta ds. \end{align}$

(4.4)

This expression, though complex, can produce a closed-form quantity to properly select $\pi{(\theta)}$ and $m_j(\theta)$ .

The portfolio risk decomposition with $\text{TCE}$ is additive, that is, the sum of all portfolio risk decomposition must amount to the $\text{TCE}$ for $S$ . We can verify this

$\begin{align*} \nonumber \sum\limits_{i = 1}^{n}& \text{E}(X_i|S > s_p)\\ \nonumber& = \frac{1}{1-p}\sum\limits_{i = 1}^{n}\int_{s_p}^{\infty}\int_{0}^{\infty}\left[m_i(\theta)+\frac{\sum_{j = 1}^{n}\sigma_{ij}}{\sigma_{S}^2}\left(s-\sum\limits_{j = 1}^{n}m_j(\theta)\right)\right]\frac{1}{\sqrt\theta\sigma_S}f_Z\left(\frac{s-\sum_{j = 1}^{n}{m_j(\theta)}}{\sqrt\theta\sigma_S}\right)\pi(\theta)d\theta ds\\ \nonumber & = \frac{1}{1-p}\int_{s_p}^{\infty}\int_{0}^{\infty}\left[\sum\limits_{i = 1}^{n}m_i(\theta)+\frac{\sum_{i = 1}^{n}\sum_{j = 1}^{n}\sigma_{ij}}{\sigma_{S}^2}\left(s-\sum\limits_{j = 1}^{n}m_j(\theta)\right)\right]\frac{1}{\sqrt\theta\sigma_S}f_Z\left(\frac{s-\sum_{j = 1}^{n}{m_j(\theta)}}{\sqrt\theta\sigma_S}\right)\pi(\theta)d\theta ds\\ \nonumber & = \frac{1}{1-p}\int_{s_p}^{\infty}\int_{0}^{\infty}\left[\sum\limits_{i = 1}^{n}m_i(\theta)+\left(s-\sum\limits_{j = 1}^{n}m_j(\theta)\right)\right]\frac{1}{\sqrt\theta\sigma_S}f_Z\left(\frac{s-\sum_{j = 1}^{n}{m_j(\theta)}}{\sqrt\theta\sigma_S}\right)\pi(\theta)d\theta ds\\ \nonumber & = \frac{1}{1-p}\int_{s_p}^{\infty}\int_{0}^{\infty}s\frac{1}{\sqrt\theta\sigma_S}f_Z\left(\frac{s-\sum_{j = 1}^{n}{m_j(\theta)}}{\sqrt\theta\sigma_S}\right)\pi(\theta)d\theta ds\\ & = \text{E}(S|S > s_p), \end{align*}$

as required. Now the general portfolio risk decomposition with $\text{TCE}$ formula for the multivariate $\text{LSME}$ distributions class in presented is a more concrete and compact manner when $\boldsymbol m{(\theta)}$ is linear in $\theta$ .

Theorem 2. Let $\boldsymbol{X} = (X_{1}, X_{2}, \cdots, X_{n})^{T}\sim \text{LSME}_n(\boldsymbol{\mu}, \boldsymbol{\Sigma}, \boldsymbol{\gamma}, g_n; \Pi)$ and denote the $\text{pdf}$ of $S = \boldsymbol{1}^{T}\boldsymbol{X}$ by $f_S(s)$ . Let $\pi'(\theta) = (c')^{-1}\theta \pi(\theta)$ be a mixing $\text{pdf}$ with $c' = \text{E}(\Theta) < \infty$ .

Then the portfolio risk decomposition with $\text{TCE}$ for the $i$ - $\text{th}$ marginal variable is given by

$\begin{align} \text{E}(&X_i|S > s_p)\\ & = b_{0, i}+b_{1, i} \text{E}(S|S > s_p)+\frac{b_{2, i}}{1-p}c'\overline{F}_{ \text{LSME}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1};\Pi'), \end{align}$

(4.5)

where $\Pi'$ is the $\text{cdf}$ corresponding to the $\text{pdf}$ $\pi'$ , the coefficients $b_{0, i}, \; b_{1, i}$ , and $b_{2, i}$ are defined as

$\begin{align*} b_{0, i} = \mu_i -b_{1, i}\sum\limits_{j = 1}^{n}\mu_j ;\; \; \; \; b_{1, i} = \frac{\sum\limits_{j = 1}^{n}\sigma_{ij}}{\sigma_S^2} ;\; \; \; \; b_{2, i} = \gamma_i-b_{1, i}\sum\limits_{j = 1}^{n}\gamma_j, \end{align*}$

and $s_p$ is the $p$ -quantile of $S$ .

Proof. Let $m_i(\theta) = \mu_i+\theta \gamma_i$ , and from (4.3) we have

$\begin{align*} \nonumber \text{E}(&X_i|S = s)\\ \nonumber & = \frac{1}{f_S(s)}\int_{0}^{\infty}\left[\mu_i+\frac{\sum_{j = 1}^{n}\sigma_{ij}}{\sigma_S^2}(s-\sum\limits_{j = 1}^{n}\mu_j)+\left(\gamma_i -\sum\limits_{j = 1}^{n}\gamma_j \frac{\sum_{j = 1}^{n}\sigma_{ij}}{\sigma_S^2}\right)\theta\right]\\ \nonumber&\; \; \; \times\frac{1}{\sqrt{\theta}\sigma_S }f_Z\left(\frac{s-\sum_{j = 1}^{n}\mu_j -\theta\sum_{j = 1}^{n}\gamma_{j}}{\sqrt\theta \sigma_S}\right)\pi(\theta)d\theta\\ \nonumber & = \frac{1}{f_S(s)}\int_{0}^{\infty}\left[b_{0, i}+b_{1, i}s+b_{2, i}\theta\right]\frac{1}{\sqrt{\theta}\sigma_S}f_Z\left(\frac{s-\sum_{j = 1}^{n}\mu_j -\theta\sum_{j = 1}^{n}\gamma_{j}}{\sqrt\theta \sigma_S}\right)\pi(\theta)d\theta\\ \nonumber & = \frac{1}{f_S(s)}\left[b_{0, i}f_S(s)+b_{1, i}sf_S(s)+b_{2, i}c'f_{ \text{LSME}, 1}(s_{p};\boldsymbol{1}^{T}\mu, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1};\Pi')\right]\\ & = b_{0, i}+b_{1, i}s+b_{2, i}c'\frac{f_{ \text{LSME}, 1}(s_{p};\boldsymbol{1}^{T}\mu, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1};\Pi')}{f_S(s)}. \end{align*}$

By inserting this into the portfolio risk decomposition with $\text{TCE}$ formulation (4.4), we complete the proof as

$\begin{align} \text{E}(X_i&|S > s_p) = \int_{s_p}^{\infty} \text{E}(X_i|S = s)f(s|S > s_p)ds\\ & = \int_{s_p}^{\infty} \text{E}(X_i|S = s)\frac{f_S(s)}{1-p}ds\\ & = \frac{1}{1-p}\int_{s_p}^{\infty}(b_{0, i}+b_{1, i}s)f_S(s)ds+\frac{1}{1-p}b_{2, i}c'\int_{s_p}^{\infty}f_{ \text{LSME}, 1}(s;\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1};\Pi')ds\\ & = b_{0, i}+b_{1, i} \text{E}(S|S > s_p)+\frac{b_{2, i}}{1-p}c'\overline{F}_{ \text{LSME}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1};\Pi'). \end{align}$

(4.6)

Notice that $\sum_{i = 1}^{n}b_{0, i} = 0, \; \sum_{i = 1}^{n}b_{1, i} = 1$ , and $\sum_{i = 1}^{n}b_{2, i} = 0$ , which can be used to verify that the sum of these portfolio risk decomposition amounts to $\text{E}(S|S > s_p)$ .

Corollary 2. Let $\boldsymbol{X} = (X_{1}, X_{2}, \cdots, X_{n})^{T}\sim \text{GHE}_{n}(\boldsymbol{\mu}, \boldsymbol{\Sigma}, \boldsymbol{\gamma}, g_n, \lambda, a, b)$ . The portfolio risk decomposition with $\text{TCE}$ for the $i$ - $\text{th}$ marginal variable is given by

$\begin{align} \text{E}&(X_i|S > s_p)\\ & = \mu_{i}+\frac{\sigma_{Z}^{2}\sum\limits_{j = 1}^{n}\sigma_{ij}}{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}f_{ \text{GHE}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, \overline{G};\lambda+1, a, b)\\ &\; \; \; +\frac{\gamma_{i} }{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\overline{F}_{ \text{GHE}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1}, \lambda+1, a, b). \end{align}$

(4.7)

Proof. We can know $S\sim \text{GHE}_{1} (s_{p}; \boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1}, \lambda+1, a, b)$ by using Proposition 1. Using (4.5), we see that $\text{TCE}$ of $S$ is given by

$\begin{align*} \nonumber \text{E}&(S|S > s_p)\\ \nonumber& = \mu_{i}+\frac{\sigma_{Z}^{2}\sum\limits_{j = 1}^{n}\sigma_{ij}}{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}f_{ \text{GHE}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, \overline{G}, \lambda+1, a, b)\\ &\; \; \; +\frac{\gamma_{i} }{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\overline{F}_{ \text{GHE}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1}, \lambda+1, a, b). \end{align*}$

Therefore

$\begin{align*} \nonumber \text{E}&(X_i|S > s_p)\\ \nonumber& = \mu_{i}+b_{1, i}\frac{\sigma_{S}^{2}\sigma_{Z}^{2}}{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}f_{ \text{GHE}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, \overline{G}, \lambda+1, a, b)\\ \nonumber&\; \; \; +\frac{\gamma_{i} }{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\overline{F}_{ \text{GHE}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1}, \lambda+1, a, b)\\ \nonumber& = \mu_{i}+\frac{\sigma_{Z}^{2}\sum_{j = 1}^{n}\sigma_{ij}}{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}f_{ \text{GHE}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, \overline{G}, \lambda+1, a, b)\\ &\; \; \; +\frac{\gamma_{i} }{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\overline{F}_{ \text{GHE}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, g_{1}, \lambda+1, a, b). \end{align*}$

5. Tail variance for univariate case

The $\text{TV}$ of the univariate elliptical distribution is introduced by ^[8]. From (3.1), We can write the $\text{TV}$ for $X|\theta$ as

$\begin{align*} \label{(23)} \nonumber \text{TV}_p(X|\theta) = & \text{Var}(X|\theta)\left[r\left(\kappa(x_p;\theta)\right)+h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\left(\kappa(x_p;\theta)-h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\sigma_{Z}^2\right)\right], \end{align*}$

where $\kappa(x; \theta)$ is the same as in (3.4),

$\begin{align*} r(z) = \frac{\overline{F}_{Z^*}(z)}{\overline{F}_{Z}(z)} \end{align*}$

is the distorted ratio function, and

$\begin{align*} h_{Z, Z^*}(z) = \frac{f_{Z^*}(z)}{\overline{F}_{Z}(z)} \end{align*}$

is the distorted hazard function.

$\text{TV}$ can be rewritten as:

$\begin{align} \text{TV}_p(X) = & \text{Var}(X|X > x_p) = \text{E}\left[\left(X- \text{TCE}_p(X)\right)^2|X > x_p\right]\\ = & \text{E}(X^2|X > x_p)-\left[ \text{TCE}_p(X)\right]^2. \end{align}$

(5.1)

Consequently, we need to derive the second order conditional tail moment $\text{E}(X^2|X > x_p)$ . We now provide its analytic expression in the following result.

Proposition 2. Assume a random variable $X\sim \text{LSME}_1(\mu, \sigma^2, \gamma, g_1;\Pi)$ . Let $\pi'(\theta) = (c')^{-1}\theta \pi(\theta)$ and $\pi''(\theta) = (c'')^{-1}\theta^2 \pi(\theta)$ be two different mixing $\text{pdf}$ s with $c' = \text{E}(\Theta) < \infty$ and $c'' = \text{E}(\Theta^2) < \infty$ respectively. Then

$\begin{align} \text{E}&(X^2|X > x_p)\\ & = \frac{\sigma^2\sigma_{Z}^{2}}{1-p}\left[(x_p+\mu)c'f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi')+\gamma c'' f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi'')\right.\\ &\; \; \; + \left. c'\overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi')\right]+\frac{\gamma}{1-p}\left[ 2\mu c' \overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_1;\Pi')\right.\\ &\; \; \; + \left. \gamma c'' \overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_1;\Pi'')\right]+\mu^2, \end{align}$

(5.2)

where $\Pi'$ and $\Pi''$ are two $\text{cdf}$ s corresponding to the two different $\text{pdf}$ s $\pi'$ and $\pi''$ , respectively.

Proof. From observing

$\begin{align*} \nonumber \text{E}(X^2|X > x_p) = &\frac{1}{1-p}\int_{x_p}^{\infty}x^2f_X(x)dx\\ \nonumber = &\frac{1}{1-p}\int_{0}^{\infty}\int_{x_p}^{\infty}x^2f_{X|\theta}(x)\pi(\theta)dxd\theta\\ = &\frac{1}{1-p}\int_{0}^{\infty} \text{E}_{X|\theta}(X^2|X > x_p)\overline{F}_{X|\theta}(x_p)\pi(\theta)d\theta. \end{align*}$

To deal with the second order conditional tail moment in the integration, we write it as

$\begin{align} \text{E}_{X|\theta}(X^2|X > x_p) = \text{TV}_{p}(X|\theta)+\left[ \text{TCE}_p(X|\theta)\right]^2. \end{align}$

(5.3)

From ^[17], we know

$\begin{align*} \text{TCE}_p(X|\theta) = m(\theta)+h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\frac{ \text{Var}(X|\theta)}{\sqrt\theta \sigma}, \end{align*}$

taking $\text{Var}(X|\theta) = \theta \sigma^2 \sigma_{Z}^{2}$ into consideration, then (5.3) becomes

$\begin{align*} \text{E}_{X|\theta}(X^2|X > x_p) = & \text{Var}(X|\theta)\left[r\left(\kappa(x_p;\theta)\right)+h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\left(\kappa(x_p;\theta)-h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\sigma_{Z}^2\right)\right]\\ &+\left(m(\theta)+h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\frac{ \text{Var}(X|\theta)}{\sqrt\theta \sigma}\right)^2\\ = & \text{Var}(X|\theta)r\left(\kappa(x_p;\theta)\right)+ \text{Var}(X|\theta)h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\frac{x_p-m(\theta)}{\sqrt{\theta}\sigma}\\ &- \text{Var}(X|\theta)\left(h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\right)^{2}\sigma_{Z}^2+m^{2}(\theta)+2m(\theta)h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\frac{ \text{Var}(X|\theta)}{\sqrt\theta \sigma}\\ &+\left(h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\right)^2\frac{\theta \sigma^2 \sigma_{Z}^{2} \text{Var}(X|\theta)}{\theta \sigma^2}\\ = &m^{2}(\theta)+ \text{Var}(X|\theta)\left(r\left(\kappa(x_p;\theta)\right)+ h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)\frac{x_p+m(\theta)}{\sqrt{\theta}\sigma}\right). \end{align*}$

As a result

$\begin{align*} \nonumber \text{E}&(X^2|X > x_p)\\ \nonumber & = \frac{1}{1-p}\int_{0}^{\infty} \text{E}_{X|\theta}(X^2|X > x_p)\overline{F}_{X|\theta}(x_p)\pi(\theta)d\theta\\ \nonumber & = \frac{1}{1-p}\int_{0}^{\infty} \text{Var}(X|\theta)\left[\frac{x_p+m(\theta)}{\sqrt\theta \sigma}h_{Z, Z^*}\left(\kappa(x_p;\theta)\right)+r\left(\kappa(x_p;\theta)\right)\right]\\ \nonumber &\; \; \; \times\overline{F}_{X|\theta}(x_p)\pi(\theta)d\theta+\frac{1}{1-p}\int_{0}^{\infty}m^2(\theta)\overline{F}_{X|\theta}(x_p)\pi(\theta)d\theta\\ \nonumber & = \frac{1}{1-p}\int_{0}^{\infty} \text{Var}(X|\theta)\left[\frac{x_p+m(\theta)}{\sqrt\theta \sigma}f_{Z^*}\left(\kappa(x_p;\theta)\right)+\overline{F}_{Z^*}\left(\kappa(x_p;\theta)\right)\right]\\ \nonumber &\; \; \; \times\pi(\theta)d\theta+\frac{1}{1-p}\int_{0}^{\infty}\overline{F}_{X|\theta}(x_p)\pi(\theta)\left(\mu^2+2\mu\theta\gamma+\theta^2 \gamma^2\right)d\theta\\ \nonumber & = \frac{\sigma^2\sigma_{Z}^{2}}{1-p}\left[(x_p+\mu)c'f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi')+\gamma c'' f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi'')\right.\\ \nonumber&\; \; \; + \left. c'\overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi')\right]+\frac{\gamma}{1-p}\left[ 2\mu c' \overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_1;\Pi')\right.\\ &\; \; \; + \left. \gamma c'' \overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_1;\Pi'')\right]+\mu^2. \end{align*}$

Theorem 3. Under assumptions of Proposition 2, the $\text{TV}$ of $X$ is given by

$\begin{align} \text{TV}_p&(X)\\ = &\frac{\sigma^2\sigma_{Z}^{2}}{1-p}\left[(x_p-\mu)c'f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi')+\gamma c'' f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi'')\right.\\ &+ \left. c'\overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi')\right]+\frac{\gamma^2}{1-p}c'' \overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_1;\Pi'')\\ &-\left(\frac{c'}{1-p}\right)^{2}\left[\gamma\overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_1;\Pi') +\sigma^2 \sigma_{Z}^2 f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi')\right]^{2}, \end{align}$

(5.4)

where $\Pi'$ and $\Pi''$ are two $\text{cdf}$ s corresponding to the two different $\text{pdf}$ s $\pi'$ and $\pi''$ , respectively.

Proof. From the Theorem 1, the $\text{TCE}$ formula is

$\begin{align*} \nonumber \text{E}&(X|X > x_p)\\ & = \mu+\frac{\gamma c'}{1-p}\overline{F}_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_{1};\Pi')+\frac{c'\sigma^{2}\sigma_{Z}^{2}}{1-p}f_{ \text{LSME}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\Pi'). \end{align*}$

Hence, the result can be derived by using Proposition 2 and (5.1).

Corollary 3. Let $X\sim \text{GHE}_1(\mu, \sigma^2, \gamma, g_1, \lambda, a, b)$ . Assume the conditions in Theorem 3 are satisfied, then the $\text{TV}$ for $\text{GHE}$ can be computed by:

$\begin{align} \text{TV}_p(X) = &\frac{\sigma^2\sigma_{Z}^{2}}{1-p}\left[(x_p-\mu)\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}f_{ \text{GHE}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\lambda+1, a, b)\right.\\ &+ \left. \gamma \frac{a}{b}\frac{K_{\lambda+2}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})} f_{ \text{GHE}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\lambda+2, a, b)\right.\\ &+ \left. \sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\overline{F}_{ \text{GHE}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\lambda+1, a, b)\right]\\ &+\frac{\gamma^2}{1-p}\frac{a}{b}\frac{K_{\lambda+2}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})} \overline{F}_{ \text{GHE}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_1, \lambda+2, a, b)\\ &-\left(\frac{1}{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\right)^2\left[\gamma\overline{F}_{ \text{GHE}, 1}(x_{p};\mu, \sigma^{2}, \gamma, g_1, \lambda+1, a, b) \right.\\ &+ \left.\sigma^2 \sigma_{Z}^2 f_{ \text{GHE}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \overline{G};\lambda+1, a, b)\right]^2. \end{align}$

(5.5)

Proof. From the GIG density in (2.11), we conclude

$\begin{align*} \theta\pi(\theta;\lambda, a, b) = \sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\pi(\theta;\lambda+1, a, b) \end{align*}$

and

$\begin{align*} \theta^2\pi(\theta;\lambda, a, b) = \frac{a}{b}\frac{K_{\lambda+2}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\pi(\theta;\lambda+2, a, b). \end{align*}$

By setting

$\begin{align*} c' = \sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}, \quad c'' = \frac{a}{b}\frac{K_{\lambda+2}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}, \end{align*}$

the two $\text{pdf}$ s can be presented as

$\begin{align*} \pi'(\theta) = (c')^{-1}\theta\pi(\theta) = \pi(\theta;\lambda+1, a, b) \end{align*}$

and

$\begin{align*} \pi''(\theta) = (c'')^{-1}\theta^2\pi(\theta) = \pi(\theta;\lambda+2, a, b), \end{align*}$

which also are two GIG $\text{pdf}$ s. Using (5.4) we can directly obtain (5.5).

6. Examples

$\bf{Example\; 6.1}$ (Generalized hyperbolic distribution). If $\boldsymbol{\mu} = \boldsymbol{0}$ , $\boldsymbol{\Sigma} = \boldsymbol{I_n}$ and density generator $g(u) = e^{-u}$ in (2.2), then the elliptical vector $\boldsymbol{Y}$ is said to have a multivariate normal distribution, denoted by $\boldsymbol{Y}\sim \text{N}_n(\boldsymbol{0}, \boldsymbol{I_n})$ . Letting $\boldsymbol{Y}\sim \text{N}_n(\boldsymbol{0}, \boldsymbol{I_n})$ in (2.6), then the random vector $\boldsymbol{X}\sim \text{GH}_n(\boldsymbol{\mu}, \boldsymbol{\Sigma}, \boldsymbol{\gamma}, \lambda, a, b)$ is an $n$ -dimensional generalized hyperbolic ( $\text{GH}$ ) distribution. Therefore, the pdf of the $\text{GH}$ distribution is (see ^[15])

$\begin{align*} \nonumber f_{ \text{GH}_n}&(\boldsymbol{x}, \boldsymbol{\mu}, \boldsymbol{\Sigma}, \boldsymbol{\gamma}, \lambda, a, b)\\ = &c\frac{K_{\lambda-(n/2)}\left(\sqrt{(a+(\boldsymbol{x}-\boldsymbol{\mu})^{T}\boldsymbol{\Sigma}^{-1}(\boldsymbol{x}-\boldsymbol{\mu}))(b+\boldsymbol{\gamma}^{T}\boldsymbol{\Sigma}^{-1}\boldsymbol{\gamma})}\right)e^{(\boldsymbol{x}-\boldsymbol{\mu})^{T}\boldsymbol{\Sigma}^{-1}\boldsymbol{\gamma}}}{\left(a+(\boldsymbol{x}-\boldsymbol{\mu})^{T}\boldsymbol{\Sigma}^{-1}(\boldsymbol{x}-\boldsymbol{\mu}))(b+\boldsymbol{\gamma}^{T}\boldsymbol{\Sigma}^{-1}\boldsymbol{\gamma})\right)^{\frac{n}{4}-\frac{\lambda}{2}}}, \end{align*}$

where the normalizing constant is

$\begin{align*} c = \frac{\left(a b\right)^{-\frac{\lambda}{2}}b^{\lambda}(b+\boldsymbol{\gamma}^{T}\boldsymbol{\Sigma}^{-1}\boldsymbol{\gamma})^{(n/2)-\lambda}}{(2\pi)^{n/2}|\boldsymbol{\Sigma}|^{1/2}K_{\lambda}\sqrt{ab}}. \end{align*}$

From Corollary 1, $\text{TCE}$ of $\text{GH}$ distribution is given by

$\begin{align*} \nonumber \text{TCE}_{p}(X) = &\mu+\frac{\gamma }{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\overline{F}_{ \text{GH}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \lambda+1, a, b)\\ &+\frac{\sigma^{2}}{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}f_{ \text{GH}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \lambda+1, a, b). \end{align*}$

From Corollary 2, portfolio risk decomposition with $\text{TCE}$ for the $i$ -th marginal of $\text{GH}$ distribution is given by

$\begin{align*} \nonumber \text{E}(&X_i|S > s_p)\\ \nonumber& = \mu_{i}+\frac{\sum\limits_{j = 1}^{n}\sigma_{ij}}{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}f_{ \text{GH}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, \lambda+1, a, b)\\ &\; \; \; +\frac{\gamma_{i} }{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\overline{F}_{ \text{GH}, 1}(s_{p};\boldsymbol{1}^{T}\boldsymbol{\mu}, \boldsymbol{1}^{T}\boldsymbol{\Sigma}\boldsymbol{1}, \boldsymbol{1}^{T}\boldsymbol{\gamma}, \lambda+1, a, b). \end{align*}$

From Corollary 3, $\text{TV}$ of $\text{GH}$ distribution is given by

$\begin{align*} \nonumber \text{TV}_p(X) = &\frac{\sigma^2}{1-p}\left[(x_p-\mu)\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}f_{ \text{GH}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \lambda+1, a, b)\right.\\ \nonumber &+ \left. \gamma \frac{a}{b}\frac{K_{\lambda+2}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})} f_{ \text{GH}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \lambda+2, a, b)\right.\\ \nonumber&+ \left. \sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\overline{F}_{ \text{GH}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \lambda+1, a, b)\right]\\ \nonumber &+\frac{\gamma^2}{1-p}\frac{a}{b}\frac{K_{\lambda+2}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})} \overline{F}_{ \text{GH}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \lambda+2, a, b)\\ \nonumber&-\left(\frac{1}{1-p}\sqrt{\frac{a}{b}}\frac{K_{\lambda+1}(\sqrt{ab})}{K_{\lambda}(\sqrt{ab})}\right)^2\left[\gamma\overline{F}_{ \text{GH}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \lambda+1, a, b) \right.\\ &+ \left.\sigma^2 f_{ \text{GH}, 1}(x_{p};\mu, \sigma^{2}, \gamma, \lambda+1, a, b)\right]^2. \end{align*}$

7. Numerical illustration

In this section we discuss the $\text{TV}$ of five stocks (Amazon, Goldman Sachs, IBM, Google, and Apple) and aggregate portfolio covering a time frame from the 1st of January 2015 to the 1st of January 2017. Ignatieva and Landsman ^[12] fitted a $\text{GH}$ model to five stocks and aggregate portfolio, and obtained the following parameter set based on the maximum likelihood technique.

$\begin{align} \lambda = &-1.18336, \; \; \; a = 1.272016, \; \; \; \psi = 0.348483, \\ \boldsymbol{\mu} = & \left( \begin{array}{c} -0.09977 \\ -0.04555 \\ -0.09355\\ -0.03669\\ -0.10367 \end{array} \right), \; \; \; \; \; \; \boldsymbol{\gamma} = \left(\begin{array}{c} -0.08626\\ -0.00803\\ 0.07928\\ -0.05230\\ 0.08534 \end{array} \right), \end{align}$

$\begin{align} \boldsymbol{\Sigma} = & \left( \begin{array}{ccccc} 3.387&1.407&1.103&1.828&1.354 \\ 1.407&3.014&1.288&1.209&1.434 \\ 1.103&1.288&1.870&1.061&1.155\\ 1.828&1.209&1.061&2.171&1.220\\ 1.354&1.434&1.155&1.220&2.891 \end{array} \right). \end{align}$

For the risk analysis, we denote five stocks as $X_{1}, \cdots, X_{5}$ . We also consider aggregate portfolio $S$ where each stock has equal weight for simplicity, so that the aggregate portfolio is defined as $S = X_{1}+\cdots+X_{5}$ . shows the densities for five stocks $X_{i}, i = 1, \cdots, 5$ and aggregate portfolio $S$ . The $\text{pdf}$ of $S$ has the largest variance, and Amazon has the largest dispersion among five stocks. IBM has the smallest dispersion. presents the $\text{TV}$ for five stocks $X_{i}, i = 1, \cdots, 5$ and aggregate portfolio $S$ . All the risk measures increase over the quantile with the $\text{TV}$ . Also shows the differences in the $\text{TV}$ measure along with five stocks and aggregate portfolio. For the same quantile, the $\text{TV}$ of Apple is the largest one and the $\text{TV}$ of IBM is the smallest one among the five stocks.

Figure 1. Densities for

$X_i$ ,

$i = 1, \cdots, 5$ and

$S$ for

$\text{GH}$ .

DownLoad: Full-Size Img PowerPoint

Figure 2.

$\text{TV}$ for

$X_i$ ,

$i = 1, \cdots, 5$ and

$S$ for

$\text{GH}$ .

DownLoad: Full-Size Img PowerPoint

8. Conclusions

In this paper we generalize the tail risk measure and portfolio risk decomposition with $\text{TCE}$ formula derived by ^[15] for the class of multivariate normal mean-variance mixture distributions to the larger class of multivariate elliptical location-scale mixtures distributions. A prominent member in the normal mean-variance mixture class is the generalized hyperbolic ( $\text{GH}$ ) distribution, which itself can construct a L $\acute{e}$ vy process. The $\text{GH}$ is a special case of normal mean-variance mixture random variable with $\boldsymbol{X}\sim \text{N}_n{(\boldsymbol{0}, \boldsymbol{I_n})}$ and the distribution of $\Theta$ given by a generalized inverse Gaussian (GIG) distribution with three parameters (see ^[12,15] for details). Prominent member in the elliptical location-scale mixtures class is the generalized hyper-elliptical ( $\text{GHE}$ ) distribution. The $\text{GHE}$ distribution provides excellent fit to univariate and multivariate data, allowing to capture a long right tail in the distribution of losses even more effectively than the $\text{GH}$ distribution considered in ^[12]. And $\text{GHE}$ is a special case of elliptical location-scale mixtures random variable with $\boldsymbol{X}\sim \text{N}_n{(\boldsymbol{0}, \boldsymbol{I_n})}$ and the distribution of $\Theta$ given by a generalized inverse Gaussian (GIG) distribution with three parameters. Although the univariate $\text{TCE}$ and portfolio risk decomposition with $\text{TCE}$ formula for the $\text{GHE}$ class was available in ^[13], it can be derived more efficiently and seen as a special case of $\text{TCE}$ for the unified location-scale mixtures of elliptical distributions and risk allocation formula in Theorems 1 and 2, respectively. And the univariate $\text{TV}$ formula for the $\text{GHE}$ class can be derived efficiently and seen as a special case of $\text{TV}$ for the unified location-scale mixtures of elliptical distributions in Theorem 3.

Acknowledgments

The research was supported by the National Natural Science Foundation of China (No. 12071251).

Conflict of interest

The authors declare no conflict of interest.

References

[1]	L. A. Torre, F. Bray, R. L. Siegel, J. Ferlay, J. Lortet-Tieulent, A. Jemal, Global cancer statistics, 2012, CA Cancer J. Clin., 65 (2015), 87–108. https://doi.org/10.3322/caac.21262 doi: 10.3322/caac.21262
[2]	M. F. Akay, Support vector machines combined with feature selection for breast cancer diagnosis, Expert Syst. Appl., 36 (2009), 3240–3247. https://doi.org/10.1016/j.eswa.2008.01.009 doi: 10.1016/j.eswa.2008.01.009
[3]	R. L. Siegel, K. D. Miller, A. Jemal, Cancer statistics, 2018, CA Cancer J. Clin., 68 (2018), 7–30. https://doi.org/10.3322/caac.21442 doi: 10.3322/caac.21442
[4]	L. Peng, W. Chen, W. Zhou, F. Li, J. Yang, J. Zhang, An immune-inspired semi-supervised algorithm for breast cancer diagnosis, Comput. Methods Programs Biomed., 134 (2016), 259–265. https://doi.org/10.1016/j.cmpb.2016.07.020 doi: 10.1016/j.cmpb.2016.07.020
[5]	H. L. Chen, B. Yang, J. Liu, D. Y. Liu, A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis, Expert Syst. Appl., 38 (2011), 9014–9022. https://doi.org/10.1016/j.eswa.2011.01.120 doi: 10.1016/j.eswa.2011.01.120
[6]	J. B. Li, Y. Peng, D. Liu, Quasiconformal kernel common locality discriminant analysis with application to breast cancer diagnosis, Inf. Sci., 223 (2013), 256–269. https://doi.org/10.1016/j.ins.2012.10.016 doi: 10.1016/j.ins.2012.10.016
[7]	B. Zheng, S. W. Yoon, S. S. Lam, Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms, Expert Syst. Appl., 4 (2014), 1476–1482. https://doi.org/10.1016/j.eswa.2013.08.044 doi: 10.1016/j.eswa.2013.08.044
[8]	F. Gorunescu, S. Belciug, Evolutionary strategy to develop learning-based decision systems. Application to breast cancer and liver fibrosis stadialization, J. Biomed. Inform., 49 (2014), 112–118. https://doi.org/10.1016/j.jbi.2014.02.001 doi: 10.1016/j.jbi.2014.02.001
[9]	M. Karabatak, A new classifier for breast cancer detection based on Naive Bayesian, Meas., 72 (2015), 32–36. https://doi.org/10.1016/j.measurement.2015.04.028 doi: 10.1016/j.measurement.2015.04.028
[10]	R. Sheikhpour, M. A. Sarram, R. Sheikhpour, Particle swarm optimization for bandwidth determination and feature selection of kernel density estimation based classifiers in diagnosis of breast cancer, Appl. Soft Comput., 40 (2016), 113–131. https://doi.org/10.1016/j.asoc.2015.10.005 doi: 10.1016/j.asoc.2015.10.005
[11]	M. F. Ijaz, M. Attique, Y. Son, Data-driven cervical cancer prediction model with outlier detection and over-sampling methods, Sensors, 20 (2020), 2809. https://doi.org/10.3390/s20102809 doi: 10.3390/s20102809
[12]	M. Mandal, P. K. Singh, M. F. Ijaz, J. Shafi, R. Sarkar, A Tri-Stage Wrapper-Filter Feature Selection Framework for Disease Classification, Sensors, 21 (2021), 5571. https://doi.org/10.3390/s21165571 doi: 10.3390/s21165571
[13]	H. Patel, G. S. Thakur, Classification of imbalanced data using a modified fuzzy-neighbor weighted approach, Int. J. Intell. Eng. Syst., 10 (2017), 56–64. https://doi.org/10.22266/ijies2017.0228.07 doi: 10.22266/ijies2017.0228.07
[14]	W. C. Lin, C. F. Tsai, Y. H. Hu, J. S. Jhang, Clustering-based undersampling in class-imbalanced data, Inf. Sci., 409 (2017), 17–26. https://doi.org/10.1016/j.ins.2017.05.008 doi: 10.1016/j.ins.2017.05.008
[15]	P. D. Turney, Cost-sensitive classification: Empirical evaluation of a hybrid genetic decision tree induction algorithm, J. Artif. Intell. Res., 2 (1994), 369–409. https://doi.org/10.1613/jair.120 doi: 10.1613/jair.120
[16]	H. E. Kiziloz, Classifier ensemble methods in feature selection, Neurocomputing, 419 (2021), 97–107. https://doi.org/10.1016/j.neucom.2020.07.113 doi: 10.1016/j.neucom.2020.07.113
[17]	M. Galar, A. Fernández, E. Barrenechea, H. Bustince, F. Herrera, Ordering-based pruning for improving the performance of ensembles of classifiers in the framework of imbalanced datasets, Inf. Sci., 354 (2016), 178–196. https://doi.org/10.1016/j.ins.2016.02.056 doi: 10.1016/j.ins.2016.02.056
[18]	J. Zhang, L. Chen, J. Tian, F. Abid, W. Yang, X. Tang, Breast cancer diagnosis using cluster-based undersampling and boosted C5. 0 algorithm, Int. J. Control Autom. Syst., 19 (2021), 1998–2008. https://doi.org/10.1007/s12555-019-1061-x doi: 10.1007/s12555-019-1061-x
[19]	Z. Zheng, X. Wu, R. Srihari, Feature selection for text categorization on imbalanced data, ACM Sigkdd Explor. Newsl., 6 (2004), 80–89. https://doi.org/10.1145/1007730.1007741 doi: 10.1145/1007730.1007741
[20]	S. Punitha, F. Al-Turjman, T. Stephan, An automated breast cancer diagnosis using feature selection and parameter optimization in ANN, Comput. Electr. Eng., 90 (2021), 106958. https://doi.org/10.1016/j.compeleceng.2020.106958 doi: 10.1016/j.compeleceng.2020.106958
[21]	P. N. Srinivasu, J. G. SivaSai, M. F. Ijaz, A. K. Bhoi, W. Kim, J. J. Kang, Classification of skin disease using deep learning neural networks with MobileNet V2 and LSTM, Sensors, 21 (2021), 2852. https://doi.org/10.3390/s21082852 doi: 10.3390/s21082852
[22]	H. Naeem, A. A. Bin-Salem, A CNN-LSTM network with multi-level feature extraction-based approach for automated detection of coronavirus from CT scan and X-ray images, Appl. Soft Comput., 113 (2021), 107918. https://doi.org/10.1016/j.asoc.2021.107918 doi: 10.1016/j.asoc.2021.107918
[23]	P. Huang, Q. Ye, F. Zhang, G. Yang, W. Zhu, Z. Yang, Double L2, p-norm based PCA for feature extraction, Inf. Sci., 573 (2021), 345–359. https://doi.org/10.1016/j.ins.2021.05.079 doi: 10.1016/j.ins.2021.05.079
[24]	H. D. Cheng, X. J. Shi, R. Min, L. M. Hu, X. P. Cai, H. N. Du, Approaches for automated detection and classification of masses in mammograms, Pattern Recognit., 4 (2006), 646–668. https://doi.org/10.1016/j.patcog.2005.07.006 doi: 10.1016/j.patcog.2005.07.006
[25]	T. Raeder, G. Forman, N. V. Chawla, Learning from imbalanced data: Evaluation matters, in Data mining: Foundations and intelligent paradigms, Springer, (2012), 315–331. https://doi.org/10.1007/978-3-641-23166-7_12 doi: 10.1007/978-3-641-23166-7_12
[26]	S. Piri, D. Delen, T. Liu, A synthetic informative minority over-sampling (SIMO) algorithm leveraging support vector machine to enhance learning from imbalanced datasets, Decis. Support Syst., 106 (2018), 15–29. https://doi.org/10.1016/j.dss.2017.11.006 doi: 10.1016/j.dss.2017.11.006
[27]	C. Seiffert, T. M. Khoshgoftaar, J. Van. Hulse, A. Napolitano, RUSBoost: A hybrid approach to alleviating class imbalance, IEEE Trans. Syst. Man Cybern. Part A: Syst. Hum., 40 (2009), 185–197. https://doi.org/10.1109/tsmca.2009.2029559 doi: 10.1109/tsmca.2009.2029559
[28]	N. Liu, E. S. Qi, M. Xu, B. Gao, G. Q. Liu, A novel intelligent classification model for breast cancer diagnosis, Inf. Process. Manage., 56 (2019), 609–623. https://doi.org/10.1016/j.ipm.2018.10.014 doi: 10.1016/j.ipm.2018.10.014
[29]	S. Wang, Y. Wang, D. Wang, Y. Yin, Y. Wang, Y. Jin, An improved random forest-based rule extraction method for breast cancer diagnosis, Appl. Soft Comput., 86 (2020), 105941. https://doi.org/10.1016/j.asoc.2019.105941 doi: 10.1016/j.asoc.2019.105941
[30]	H. Wang, B. Zheng, S. W. Yoon, H. S. Ko, A support vector machine-based ensemble algorithm for breast cancer diagnosis, Eur. J. Oper. Res., 267 (Year), 687–699. https://doi.org/10.1016/j.ejor.2017.12.001 doi: 10.1016/j.ejor.2017.12.001
[31]	L. Breiman, Bagging predictors, Mach. Learn., 24 (1996), 123–140. https://doi.org/10.1007/BF00058655 doi: 10.1007/BF00058655
[32]	A. Taherkhani, G. Cosma, T. M. McGinnity, AdaBoost-CNN: An adaptive boosting algorithm for convolutional neural networks to classify multi-class imbalanced datasets using transfer learning, Neurocomputing, 404 (2020), 351–366. https://doi.org/10.1016/j.neucom.2020.03.064 doi: 10.1016/j.neucom.2020.03.064

This article has been cited by:

Mengxin He, Zhong Li, Dynamic behaviors of a Leslie-Gower predator-prey model with Smith growth and constant-yield harvesting, 2024, 32, 2688-1594, 6424, 10.3934/era.2024299

Reader Comments

Your name:*

Email:*
© 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Mathematical Biosciences and Engineering

3.9

Metrics

Article views(3448) PDF downloads(200) Cited by(13)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Mathematical Biosciences and Engineering

Breast cancer diagnosis using feature extraction and boosted C5.0 decision tree algorithm with penalty factor

Related Papers:

Abstract

1. Introduction and motivation

2. Mixture of elliptical distributions

3. Tail conditional expectation for univariate case

4. Portfolio risk decomposition with ${TCE}$ for multivariate ${LSME}$

5. Tail variance for univariate case

6. Examples

7. Numerical illustration

8. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Abstract

1. Introduction and motivation

2. Mixture of elliptical distributions

3. Tail conditional expectation for univariate case

4. Portfolio risk decomposition with ${TCE}$ for multivariate ${LSME}$

5. Tail variance for univariate case

6. Examples

7. Numerical illustration

8. Conclusions

Acknowledgments

Conflict of interest

References

Mathematical Biosciences and Engineering

Breast cancer diagnosis using feature extraction and boosted C5.0 decision tree algorithm with penalty factor

Related Papers:

Abstract

1. Introduction and motivation

2. Mixture of elliptical distributions

3. Tail conditional expectation for univariate case

4. Portfolio risk decomposition with TCE {TCE} for multivariate LSME {LSME}

5. Tail variance for univariate case

6. Examples

7. Numerical illustration

8. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction and motivation

2. Mixture of elliptical distributions

3. Tail conditional expectation for univariate case

4. Portfolio risk decomposition with {TCE} {TCE} for multivariate {LSME} {LSME}

5. Tail variance for univariate case

6. Examples

7. Numerical illustration

8. Conclusions

Acknowledgments

Conflict of interest

References

4. Portfolio risk decomposition with ${TCE}$ for multivariate ${LSME}$

4. Portfolio risk decomposition with ${TCE}$ for multivariate ${LSME}$