Enhancing skeleton-based human motion recognition with Lie algebra and memristor-augmented LSTM and CNN

Zhencheng Fan; Zheng Yan; Yuting Cao; Yin Yang; Shiping Wen; Zhencheng Fan; Zheng Yan; Yuting Cao; Yin Yang; Shiping Wen

doi:10.3934/math.2024871

AIMS Mathematics

2024, Volume 9, Issue 7: 17901-17916. doi: 10.3934/math.2024871

Previous Article Next Article

Research article Special Issues

Enhancing skeleton-based human motion recognition with Lie algebra and memristor-augmented LSTM and CNN

1.
Australian AI Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, NSW 2007, Australia
2.
College of Science and Engineering, Hamad Bin Khalifa University, 5855, Doha, Qatar

Received: 13 December 2023 Revised: 24 March 2024 Accepted: 28 April 2024 Published: 24 May 2024
MSC : 68T07, 68T10

Lately, as a subset of human-centric studies, vision-oriented human action recognition has emerged as a pivotal research area, given its broad applicability in fields like healthcare, video surveillance, autonomous driving, sports, and education. This brief applies Lie algebra and standard bone length data to represent human skeleton data. A multi-layer long short-term memory (LSTM) recurrent neural network and convolutional neural network (CNN) are applied for human motion recognition. Finally, the trained network weights are converted into the crossbar-based memristor circuit, which can accelerate the network inference, reduce energy consumption, and obtain an excellent computing performance.

Keywords:

Citation: Zhencheng Fan, Zheng Yan, Yuting Cao, Yin Yang, Shiping Wen. Enhancing skeleton-based human motion recognition with Lie algebra and memristor-augmented LSTM and CNN[J]. AIMS Mathematics, 2024, 9(7): 17901-17916. doi: 10.3934/math.2024871

Related Papers:

[1]	Vladica S. Stojanović, Hassan S. Bakouch, Radica Bojičić, Gadir Alomair, Shuhrah A. Alghamdi . Poisson-Lindley minification INAR process with application to financial data. AIMS Mathematics, 2024, 9(8): 22627-22654. doi: 10.3934/math.20241102
[2]	Bader S. Almohaimeed . Periodic stationarity conditions for mixture periodic INGARCH models. AIMS Mathematics, 2022, 7(6): 9809-9824. doi: 10.3934/math.2022546
[3]	Muhammad Farman, Ali Akgül, Kottakkaran Sooppy Nisar, Dilshad Ahmad, Aqeel Ahmad, Sarfaraz Kamangar, C Ahamed Saleel . Epidemiological analysis of fractional order COVID-19 model with Mittag-Leffler kernel. AIMS Mathematics, 2022, 7(1): 756-783. doi: 10.3934/math.2022046
[4]	Faik Babadağ, Ali Atasoy . On hyper-dual vectors and angles with Pell, Pell-Lucas numbers. AIMS Mathematics, 2024, 9(11): 30655-30666. doi: 10.3934/math.20241480
[5]	I. H. K. Premarathna, H. M. Srivastava, Z. A. M. S. Juman, Ali AlArjani, Md Sharif Uddin, Shib Sankar Sana . Mathematical modeling approach to predict COVID-19 infected people in Sri Lanka. AIMS Mathematics, 2022, 7(3): 4672-4699. doi: 10.3934/math.2022260
[6]	Antonio Di Crescenzo, Alessandra Meoli . On a fractional alternating Poisson process. AIMS Mathematics, 2016, 1(3): 212-224. doi: 10.3934/Math.2016.3.212
[7]	CW Chukwu, S. Y. Tchoumi, Z. Chazuka, M. L. Juga, G. Obaido . Assessing the impact of human behavior towards preventative measures on COVID-19 dynamics for Gauteng, South Africa: a simulation and forecasting approach. AIMS Mathematics, 2024, 9(5): 10511-10535. doi: 10.3934/math.2024514
[8]	Shahbaz Ali, Muhammad Khalid Mahmmod, Raúl M. Falcón . A paradigmatic approach to investigate restricted hyper totient graphs. AIMS Mathematics, 2021, 6(4): 3761-3771. doi: 10.3934/math.2021223
[9]	Lihong Guan, Xiaohong Wang . A discrete-time dual risk model with dependence based on a Poisson INAR(1) process. AIMS Mathematics, 2022, 7(12): 20823-20837. doi: 10.3934/math.20221141
[10]	C. W. Chukwu, Fatmawati . Modelling fractional-order dynamics of COVID-19 with environmental transmission and vaccination: A case study of Indonesia. AIMS Mathematics, 2022, 7(3): 4416-4438. doi: 10.3934/math.2022246

Abstract

1. Introduction

Various modelling approaches have been proposed for the time series of counts and recent reviews on this topic can be found in many sources (^{[5,15,16,21,22,24,32,34]}). The time series of counts analysis in the literature can generally be classified into parameter-driven or observation-driven models. Zeger ^[38] initiated a class of parameter-driven models for time series of counts, which introduces autocorrelation as well as over-dispersion into the model through a latent process. Despite having wide applications in various fields, there is difficulty with efficient estimation for such parameter-driven models. Recently, Koh et al. ^[24] showed that the Monte Carlo Expectation Maximization (MCEM) algorithm is one of the alternative for the estimation of the parameters, and particle filter and smoother are useful approaches in inferencing the unobserved latent variables of the model. Jung et al. ^[21] and Jung and Tremayne ^[22] have reviewed both approaches of the parameter-driven and observation-driven models for time series of counts as well as the estimation and diagnostic tests performed for these time series models.

Observation-driven thinning-based models are commonly used to model over-dispersed or under-dispersed count data, see for example Weiß ^[33], Bourguignon and Weiß ^[3], Yang ^[37] and Kang ^[23]. Weiß ^[33] showed that the first-order nonnegative integer-value autoregressive (INAR(1)) model based on binomial thinning operator with Good- and power-law weighted Poisson-distributed innovations are particularly well-suited for modelling under-dispersed counts. Bourguignon and Weiß ^[3] proposed a new INAR(1) model for stationary count data processes with Bernoulli-geometric (BerG) marginal distributions, that can model time series of counts with over-, equi- and under-dispersion, from a new generalized thinning operator based on the convolution of binomial and negative binomial random variables. Recently, Yang ^[37] introduced a novel thinning operator based on the generalized Poisson distribution, called GP thinning operator, and Kang ^[23] defined a new thinning operator, called GSC thinning operator, based on a new discrete distribution proposed by Gómez-Déniz et al. ^[17], to construct a new INAR(1) model to capture the dispersion features of count time series count.

Another popular observation-driven approach for modelling time series of count are the INGARCH models. Heinen ^[19] proposed the autoregressive conditional Poisson (ACP) model, Ferland et al. ^[14] called it as integer-valued generalised autoregressive conditional heteroscedasticity (INGARCH) model, that has the advantage in the ease of application of the likelihood to evaluate the model. More specifically, the time series of counts $\left\{{Y}_{t}\right\}$ for $t\ge 1$ follows a conditional Poisson distribution with an autoregressive mean ${\mu }_{t}$ , which is defined as

${Y}_{t}\left|{\mathcal{F}}_{t-1}\right.\sim Poisson\left({\mu }_{t}\right)$

$E\left[{Y}_{t}\left|{\mathcal{F}}_{t-1}\right.\right] = {\mu }_{t} = {\alpha }_{0}+\sum _{i = 1}^{p}{\alpha }_{i}{Y}_{t-i}+\sum _{j = 1}^{q}{\beta }_{j}{\mu }_{t-j} ,$

(1.1)

where ${\alpha }_{0} > 0$ , ${\alpha }_{i}\ge 0$ , ${\beta }_{j}\ge 0$ , $i = \mathrm{1, 2}, \dots, p$ , $j = \mathrm{1, 2}, \dots, q$ , $p\ge 1, q\ge 0$ and ${\mathcal{F}}_{t-1}$ denotes the information available on the series up to and including time $t-1$ . It is worth noting that the conditional mean (1.1), dependent on past observed counts and past conditional means, is similar to the conditional variance in the GARCH model by Bollerslev ^[2] as well as the conditional intensity of the autoregressive conditional duration model by Engle and Russell ^[12] for continuous-valued discrete-time data. However, the INGARCH model with Poisson distribution is suitable only for positive serial correlation and over-dispersion in the count data.

In this paper, we will focus on developing a class of observation-driven INGARCH-type models for modelling time series of counts that may exhibit over-, equi- and under-dispersion. The Poisson distribution, which has been commonly used to model time series of counts, is restrictive to the model variance equal to the mean; that is, for modelling equi-dispersed data. Therefore, Heinen ^[19] proposed the generalised double autoregressive conditional Poisson model which can model over-, equi- and under-dispersion of count time series. The double Poisson (DP) distribution proposed by Efron ^[11], which has two parameters, is used in this model. Bourguignon et al. ^[4] also used the double Poisson distribution to extend the INAR(1) for modelling count time series with over-, equi- and under-dispersion. However, Winkelmann ^[36] found that the results of the DP with its normalizing constant approximated by Efron's original method are not exact, making it a hurdle for applications. Moreover, Zhu ^[41] also argued that the DP distribution, having intractable normalizing constant and moments, is difficult to utilize for modelling and many of its properties remained unknown. Hence, some of the theoretical aspects of the resulting DP-INGARCH(1, 1) models may be difficult to establish. On the other hand, Zou et al. ^[42] noted that while DP provides a good fit when the mean is high for all dispersion types, the fit is highly unreliable when the mean is small. With these shortcomings, the DP distribution may be unattractive for the analysis of time series despite being able to model over-, equi- and under-dispersion of the count data.

Zhu ^[39] presented the negative binomial INGARCH (NB-INGARCH) model that works well for an over-dispersed poliomyelitis monthly data, and claimed that the proposed model is better than the Poisson- and DP-INGARCH models. Although the mean-variance relationship of the NB enables over-dispersion to be captured, there is difficulty in handling data characterized by under-dispersion (^[26]). Subsequently, Zhu ^[41] proposed a generalized Poisson INGARCH (GP-INGARCH) model as an alternative model to account for both over- and under-dispersion in time series of counts data. The generalized Poisson (GP) distribution has been studied extensively by many sources (^[6,7,13]). Unfortunately, Zhu ^[40] states that the GP model is not a true probability model under certain conditions, leading to its inability to model some levels of under-dispersion.

Zhu ^[40] in turn addressed the weakness of this model by proposing a COM-Poisson INGARCH model and indicating that this model is a powerful competitor to the GP model. The COM-Poisson (COMP) distribution has been proposed by Conway and Maxwell ^[8] as a model for queuing systems with state-dependent service times. This distribution has been used widely after Shmueli et al. ^[29] further examined its statistical and probabilistic properties. The COMP distribution has two parameters with λ as the centering parameter and v as the dispersion parameter. Zhu ^[40] noted that a COMP model based on the COMP formulation for λ would be difficult to interpret. To overcome this problem, a re-parameterization of the COMP distribution, $\mu = {\lambda }^{1/v}$ , has been proposed by Guikema and Goffelt ^[18] to provide a clear centering parameter. Recently, Qian and Zhu ^[27] used the generalized COMP distribution, which has one more parameter than COMP distribution to handle heavy-tailed count time series.

Suitable to model both over- and under-dispersed data, the hyper-Poisson (HP) distribution is a popular distributions besides DP, GP and COMP distributions and belongs to the two-parameter family of discrete distributions. The HP distribution, introduced by Bardwell and Crow ^[1], is generated by the confluent function and admits over-dispersion as well as under-dispersion. Although the HP distribution was applied in generalized linear models and regression models for over-dispersed and under-dispersed count data (see Sáez-Castillo and Conde-Sánchez ^[28]), it is difficult to apply in INGARCH model. The mean and variance are in confluent hypergeometric form and hence, the conditional mean and conditional variance for INGARCH model are hard to derive. Kumar and Nair ^[25] then introduced an alternative hyper-Poisson (AHP) distribution, which has simpler forms of the mean and variance.

In this paper, we introduce the AHP distribution as an INGARCH model for modelling over-dispersed and under-dispersed count time series data, and then compare it to existing INGARCH models. The AHP distribution has simpler forms for the mean and variance compared to the HP distribution, making it potentially more viable and useful choice as an INGARCH model. We also investigate the properties of the AHP distribution and derive a lower bound for one of the parameters to ensure a valid probability mass function (pmf) for the case of under-dispersion. Similar to Weiß ^[31], we provide a set of Yule-Walker type equations from which the autocorrelation function of general AHP-INGARCH( $p, q$ ) models can be obtained. In particular, we derive the equations of variance and autocorrelation function for AHP-INGARCH(1, 1) model. It is worth to noting that the unconditional variance of AHP-INGARCH(1, 1) model is more general than the unconditional variance of INGARCH model of Weiß ^[31]. The INGARCH model of Weiß ^[31] is a special case of AHP-INGARCH model when $\gamma = 1$ . In the applications to three real-life data sets, we have included five existing INGARCH models, namely the Poisson-, NB-, COMP-, GP- and DP-INGARCH models in direct comparison against the proposed AHP-INGARCH model, highlighting the usefulness of AHP-INGARCH as a competitive model in practice.

The contents of this paper are organized as follows. Section 2 presents the AHP distribution along with its basic properties, while Section 3 outlines the proposed INGARCH model with the AHP distribution. The maximum likelihood estimation for the proposed model is briefly discussed in Section 4, followed by a simulation study to assess the performance of this estimation method in Section 5. Section 6 demonstrates the application of the proposed model to three real-life data sets, which exhibit different types of dispersions, in comparison to some existing models for time series of counts. Finally, Section 7 provides some concluding remarks.

2. Some properties of the alternative hyper-Poisson distribution

In this section, we give a brief review of the hyper-Poisson (HP) distribution and its alternative form. The two-parameter HP distribution is first introduced by Bardwell and Crow ^[1] through the following probability generating function (pgf):

$G\left(u\right) = E\left[{u}^{Z}\right] = \frac{\phi \left(1;\gamma ;\theta u\right)}{\phi \left(1;\gamma ;\theta \right)}, \theta > 0, \gamma > 0,$

where $\phi \left(\mathrm{a}; b;c\right) = \sum _{k = 0}^{\mathrm{\infty }}\frac{{\left(\mathrm{a}\right)}_{k}{c}^{k}}{{\left(b\right)}_{k}k!}$ is the confluent hypergeometric series, or known as the Kummer M function, in which ${\left(a\right)}_{k} = \frac{\mathrm{\Gamma }\left(a+k\right)}{\mathrm{\Gamma }\left(a\right)}$ is the Pochhammer symbol and $\mathrm{\Gamma }\left(\bullet \right)$ is a gamma function. Sáez-Castillo and Conde-Sánchez ^[28] utilize the HP distribution to model over-dispersed and under-dispersed count data and showed that the range of values of the dispersion index of HP distribution is wide in both situations of over- and under-dispersion. However, the mean of HP distribution can only be expressed in term of the confluent hypergeometric series.

Kumar and Nair ^[25] then introduced an alternative form of HP distribution, called AHP. The pmf and pgf of AHP distribution ( $AHP\left(\theta, \gamma \right))$ can be written, respectively as

$P\left(Z = z\right) = \frac{{\theta }^{z}}{{\left(\gamma \right)}_{z}}\phi \left(1+z;\gamma +z;-\theta \right), \theta > 0, \gamma > 0, z = \mathrm{0, 1}, 2, \dots ,$

and

$G\left(u\right) = \phi \left(1;\gamma ;\theta \left(u-1\right)\right) = \sum \limits_{k = 0}^{\mathrm{\infty }}\frac{{\left(1\right)}_{k}}{{\left(\gamma \right)}_{k}}\frac{{\left[\theta \left(u-1\right)\right]}^{k}}{k!},$

where $Z$ is the AHP distributed random variable. Note that when $\gamma = 1$ , we have $G\left(u\right) = \sum _{k = 0}^{\mathrm{\infty }}\frac{{\left[\theta \left(u-1\right)\right]}^{k}}{k!} = {e}^{\theta \left(u-1\right)}$ , the pgf of a Poisson distribution with parameter $\theta$ .

The mean and variance of a AHP distributed random variable $Z$ are, respectively, ${\mu }_{Z} = E\left[Z\right] = \frac{\theta }{\gamma }$ and ${\sigma }_{Z}^{2} = \frac{\theta }{\gamma }\left[1+\frac{\theta }{\gamma }\left(\frac{\gamma -1}{\gamma +1}\right)\right] = {\mu }_{Z}-{\mu }_{Z}^{2}+\frac{2\gamma {\mu }_{Z}^{2}}{1+\gamma }$ . However, we found that in order to guarantee the positivity of the variance, the parameter $\gamma$ needs to be constrained by $\gamma > \frac{-1-\theta +\sqrt{{\theta }^{2}+6\theta +1}}{2}$ (or $\gamma > \frac{{\mu }_{Z}-1}{{\mu }_{Z}+1}$ ). shows the feasible range of $\gamma$ for a given ${\mu }_{Z}$ ; we note that large $\gamma$ is required for large ${\mu }_{Z}$ so that ${\sigma }_{Z}^{2} > 0$ .

Figure 1. Feasible range of

$\gamma$ for a given mean

${\mu }_{Z}$ (shaded region).

DownLoad: Full-Size Img PowerPoint

The parameter constraint for the positivity of the variance can also be rewritten as $\theta < {\theta }_{1}$ , for $0 < \gamma < 1$ , where ${\theta }_{1} = \frac{\gamma (\gamma +1)}{1-\gamma }$ . However, in order to guarantee the positivity of the pmf of AHP distribution, the parameter $\theta$ needs to be constrained by $\theta < {\theta }_{2}$ , where ${\theta }_{2}$ is the solution of $\phi \left(\gamma -1;\gamma; {\theta }_{2}\right) = 0$ (see Appendix A.1 for the poof). shows the boundary lines of ${\theta }_{1}$ and ${\theta }_{2}$ for a given $\gamma \in \left(\mathrm{0, 1}\right)$ . For an AHP distribution, the feasible region of $\theta$ for a given $\gamma \in \left(\mathrm{0, 1}\right)$ is the region below the solid line in Figure 2.

Figure 2. The boundary lines for the positivity of variance (dash line) and positivity of pmf (solid line) of an AHP distribution with

$0 < \gamma < 1$ .

DownLoad: Full-Size Img PowerPoint

The dispersion index of AHP distribution is given by

${I}_{Z} = \frac{{\sigma }_{Z}^{2}}{{\mu }_{Z}} = 1-{\mu }_{Z}\left(\frac{1-\gamma }{1+\gamma }\right).$

We note that the AHP distribution is under-dispersed $\left(0 < {I}_{Z} < 1\right)$ when $\frac{-1-\theta +\sqrt{{\theta }^{2}+6\theta +1}}{2} < \gamma < 1$ , equi-dispersed $\left({I}_{Z} = 1\right)$ when $\gamma = 1,$ and over-dispersed $\left({I}_{Z} > 1\right)$ when $\gamma > 1$ . Hence, the parameter $\gamma$ is the dispersion parameter for AHP distribution. Bourguignon and Weiß ^[3] studied the dispersion behaviour of a BerG distribution, that is, the distribution of the convolution of a Bernoulli and a geometric random variable. They found that the BerG distribution can be used in count time series modelling with over-, equi- and under-dispersion. As noted by Bourguignon and Weiß ^[3], there is no feasible region for under-dispersion when the mean is larger than two for a BerG distribution. However, the AHP distribution can be seen to cover a wider range of dispersion regions compared to the BerG distribution.

In addition, the third and fourth central moments of an AHP random variable $Z$ are obtained as follows:

${\mu }_{3} = E\left[{\left(Z-{\mu }_{Z}\right)}^{3}\right] = {\mu }_{Z}-3{\mu }_{Z}^{2}+\frac{6\gamma {\mu }_{Z}^{2}}{1+\gamma }+2{\mu }_{Z}^{3}-\frac{6\gamma {\mu }_{Z}^{3}}{1+\gamma }+\frac{6{\gamma }^{2}{\mu }_{Z}^{3}}{\left(1+\gamma \right)\left(2+\gamma \right)} ,$

${\mu }_{4} = E\left[{\left(Z-{\mu }_{Z}\right)}^{4}\right] = {\mu }_{Z}-4{\mu }_{Z}^{2}+\frac{14\gamma {\mu }_{Z}^{2}}{1+\gamma }+6{\mu }_{Z}^{3}-\frac{24\gamma {\mu }_{Z}^{3}}{1+\gamma }+\frac{36{\gamma }^{2}{\mu }_{Z}^{3}}{\left(1+\gamma \right)\left(2+\gamma \right)}-3{\mu }_{Z}^{4} \\ +\frac{12\gamma {\mu }_{Z}^{4}}{1+\gamma }-\frac{24{\gamma }^{2}{\mu }_{Z}^{4}}{\left(1+\gamma \right)\left(2+\gamma \right)}+\frac{24{\gamma }^{3}{\mu }_{Z}^{4}}{\left(1+\gamma \right)\left(2+\gamma \right)\left(3+\gamma \right)}.$

3. The AHP-INGARCH(p, q) model

Let $\left\{{\left.{Y}_{t}\right\}}_{t\ge 1}\right.$ denotes a univariate time series of counts and ${\mathcal{F}}_{t-1}$ be the $\sigma$ -field generated by $\left\{\left.{Y}_{t-1}, {Y}_{t-2}, \dots \right\}\right.$ . We assume that process $\left\{{Y}_{t}\right\}$ is conditionally independent given ${\mathcal{F}}_{t-1}$ and the conditional distribution of ${Y}_{t}$ given ${\mathcal{F}}_{t-1}$ is specified by an AHP distribution, that is

${Y}_{t}|{\mathcal{F}}_{t-1}\sim AHP\left({\theta }_{t}, \gamma \right) ,$

$E\left[{Y}_{t}|{\mathcal{F}}_{t-1}\right] = {\frac{{\theta }_{t}}{\gamma } = \mu }_{t} = {\alpha }_{0}+\sum _{i = 1}^{p}{\alpha }_{i}{Y}_{t-i}+\sum _{j = 1}^{q}{\beta }_{j}{\mu }_{t-j} ,$

(3.1)

where ${\alpha }_{0} > 0, {\alpha }_{i}\ge 0, {\beta }_{j}\ge 0,$ $i = 1, \dots, p$ , $j = 1, \dots, q$ , $p\ge 1$ , $q\ge 0$ , and $\gamma > 0$ . This model is denoted by AHP-INGARCH(p, q). Since the conditional distribution of ${Y}_{t}$ given ${\mathcal{F}}_{t-1}$ is specified by an AHP distribution, this model will be able to account for equi- ( $\gamma = 1$ ), over- ( $\gamma > 1$ ) and under-dispersion ( $\frac{-1-{\theta }_{t}+\sqrt{{\theta }_{t}^{2}+6{\theta }_{t}+1}}{2} < \gamma < 1$ ). When $\gamma = 1$ , the model (3.1) is equal to the Poisson-INGARCH model introduced by Heinen ^[19] and Ferland et al. ^[14] Some of the important properties of the AHP-INGARCH(p, q) time series model are given next.

Theorem 3.1. Let $\left\{{Y}_{t}\right\}$ be a weakly stationary process with range $\left\{\mathrm{0, 1}, \dots \right\}$ following the AHP-INGARCH(p, q) model in (3.1). If $\sum _{i = 1}^{p}{\alpha }_{i}+\sum _{j = 1}^{q}{\beta }_{j} < 1$ , then

(ⅰ) the unconditional expectation of ${Y}_{t}$ is given by

${\mu }_{Y} = E\left[{Y}_{t}\right] = \frac{{\alpha }_{0}}{\left(1-\sum _{i = 1}^{p}{\alpha }_{i}-\sum _{j = 1}^{q}{\beta }_{j}\right)} ;$

(ⅱ) the covariance Cov $\left[{Y}_{t}, {\mu }_{t-k}\right]$ fulfills

$Cov\left[{Y}_{t}, {\mu }_{t-k}\right] = \left\{\begin{array}{c}Cov\left[{\mu }_{t}, {\mu }_{t-k}\right], k\ge 0, \\ Cov\left[{Y}_{t}, {Y}_{t-k}\right], k < 0;\end{array}\right.$

(ⅲ) the autocovariances ${\gamma }_{Y}\left(k\right) = Cov\left[{Y}_{t}, {Y}_{t-k}\right]$ and ${\gamma }_{\mu }\left(k\right) = Cov\left[{\mu }_{t}, {\mu }_{t-k}\right]$ will satisfy the following equations:

${\gamma }_{Y}\left(k\right) = \sum _{i = 1}^{p}{\alpha }_{i}{\gamma }_{Y}\left(\left|k-i\right|\right)+\sum _{j = 1}^{min\left(k-1, q\right)}{\beta }_{j}{\gamma }_{Y}\left(k-j\right)+\sum _{j = k}^{q}{\beta }_{j}{\gamma }_{\mu }\left(j-k\right), k\ge 1,$

(3.2)

${\gamma }_{\mu }\left(k\right) = \sum _{i = 1}^{min\left(k, p\right)}{\alpha }_{i}{\gamma }_{\mu }\left(k-i\right)+\sum _{i = k+1}^{p}{\alpha }_{i}{\gamma }_{Y}\left(i-k\right)+\sum _{j = 1}^{q}{\beta }_{j}{\gamma }_{\mu }\left(\left|k-j\right|\right), k\ge 0.$

(3.3)

The proof of Theorem 3.1 is provided in Appendix A.2.

From (3.1), the conditional mean and conditional variance of ${Y}_{t}$ are given by, respectively,

$E\left({Y}_{t}|{\mathcal{F}}_{t-1}\right) = \frac{{\theta }_{t}}{\gamma } = {\mu }_{t},$

and

$V\left({Y}_{t}|{\mathcal{F}}_{t-1}\right) = \frac{{\theta }_{t}}{\gamma }\left[1+\frac{{\theta }_{t}}{\gamma }\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}\right] = {\mu }_{t}\left[1+{\mu }_{t}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}\right] .$

Then, the unconditional mean and unconditional variance of ${Y}_{t}$ are, respectively,

$E\left({Y}_{t}\right) = {\mu }_{Y} = \frac{{\alpha }_{0}}{1-\sum _{i = 1}^{p}{\alpha }_{i}-\sum _{j = 1}^{q}{\beta }_{j}} ,$

and

$V\left({Y}_{t}\right) = {\sigma }_{Y}^{2} = E\left[V\left({Y}_{t}|{\mathcal{F}}_{t-1}\right)\right]+V\left[E\left({Y}_{t}|{\mathcal{F}}_{t-1}\right)\right] = {\mu }_{Y}+{\mu }_{Y}^{2}\frac{\gamma -1}{\gamma +1}+\left(\frac{2\gamma }{\gamma +1}\right)V\left({\mu }_{t}\right) .$

3.1. AHP-INGARCH(1, 1) model

Consider the special case of an AHP-INGARCH(1, 1) model. With arguments similar to those in Example 1 of Weiß ^[31], we obtain the unconditional variance of the AHP-INGARCH(1, 1) model as

$V\left({Y}_{t}\right) = \left({\mu }_{Y}+{\mu }_{Y}^{2}\frac{\gamma -1}{\gamma +1}\right)\left(\frac{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}\right),$

with the variance of the conditional mean given by $V\left({\mu }_{t}\right) = \frac{{\alpha }_{1}^{2}{\mu }_{Y}\left(\gamma +1\right)+{\alpha }_{1}^{2}{\mu }_{Y}^{2}\left(\gamma -1\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}$ .

The autocovariance is

${\gamma }_{Y}\left(k\right) = {\alpha }_{1}{\gamma }_{Y}\left(k-1\right)+{\beta }_{1}{\gamma }_{Y}\left(k-1\right) = {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k-1}{\gamma }_{Y}\left(1\right)$

$= {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k-1}\left({\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\gamma -1}{\gamma +1}\right)\left[\frac{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-{\alpha }_{1}{\beta }_{1}\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}\right], k\ge 1 ,$

giving the autocorrelations as

${\rho }_{Y}\left(k\right) = {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k-1}\frac{{\alpha }_{1}\left({1-\beta }_{1}^{2}-{{\alpha }_{1}\beta }_{1}\right)}{\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)} = {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k-1}\frac{{\alpha }_{1}\left[1-{\beta }_{1}\left({\alpha }_{1}+{\beta }_{1}\right)\right]}{1-{\left({\alpha }_{1}+{\beta }_{1}\right)}^{2}+{\alpha }_{1}^{2}} , k\ge 1 .$

The proof of this special case is provided in Appendix A.3.

Corollary 3.1. Suppose that $\left\{{Y}_{t}\right\}$ following the AHP-INARCH $\left(p\right)$ model with $q = 0$ in model (3.1) is second-order stationary, then the autocovariance function ${\gamma }_{Y}\left(k\right)$ satisfies the equation

${\mathrm{\gamma }}_{Y}\left(k\right) = \sum\limits _{i = 1}^{p}{\alpha }_{i}{\gamma }_{Y}\left(\left|k-i\right|\right), k\ge 1.$

3.2. AHP-INARCH(1) model

Consider the AHP-INARCH(1) model. The unconditional mean and unconditional variance are given by respectively,

$E\left({Y}_{t}\right) = {\mu }_{Y} = \frac{{\alpha }_{0}}{1-{\alpha }_{1}},$

and

$V\left({Y}_{t}\right) = {\sigma }_{Y}^{2} = \frac{{\alpha }_{0}}{{\left(1-{\alpha }_{1}\right)}^{2}}\left[\frac{\left(\gamma +1\right)\left(1-{\alpha }_{1}\right)+{\alpha }_{0}\left(\gamma -1\right)}{\gamma +1-2{\gamma \alpha }_{1}^{2}}\right].$

From Corollary 3.1, one immediately obtains the autocovariance function of the AHP-INARCH(1) model as

${\gamma }_{Y}\left(k\right) = {\left({\alpha }_{1}\right)}^{k-1}{\gamma }_{Y}\left(1\right) = {\left({\alpha }_{1}\right)}^{k-1}\left(\frac{{\alpha }_{0}}{{\left(1-{\alpha }_{1}\right)}^{2}}\right)\left[\frac{{\alpha }_{1}\left(\gamma +1\right)\left(1-{\alpha }_{1}\right)+{\alpha }_{1}{\alpha }_{0}\left(\gamma -1\right)}{\gamma +1-2{\gamma \alpha }_{1}^{2}}\right] \\ { = \left({\alpha }_{1}\right)}^{k}\left(\frac{{\alpha }_{0}}{{\left(1-{\alpha }_{1}\right)}^{2}}\right)\left(\frac{\left(\gamma +1\right)\left(1-{\alpha }_{1}\right)+{\alpha }_{0}\left(\gamma -1\right)}{\gamma +1-2{\gamma \alpha }_{1}^{2}}\right) = {\left({\alpha }_{1}\right)}^{k}{\gamma }_{Y}\left(0\right).$

Hence, the autocorrelation function of the AHP-INARCH(1) model is

${\rho }_{Y}\left(k\right) = \frac{{\gamma }_{Y}\left(k\right)}{{\gamma }_{Y}\left(0\right)} = {\left({\alpha }_{1}\right)}^{k},$

like in the standard AR(1) case.

4. Maximum likelihood estimation

In this section, we will discuss the maximum likelihood estimation (MLE) for the AHP-INGARCH (p, q) model (3.1), that is

${Y}_{t}|{\mathcal{F}}_{t-1}\sim AHP\left({\theta }_{t}, \gamma \right),$

$E\left[{Y}_{t}|{\mathcal{F}}_{t-1}\right] = \frac{{\theta }_{t}}{\gamma } = {\alpha }_{0}+\sum \limits_{i = 1}^{p}{\alpha }_{i}{Y}_{t-i}+\sum \limits_{j = 1}^{q}{\beta }_{j}\frac{{\theta }_{t-j}}{\gamma }.$

Let $\boldsymbol{\alpha } = {\left({\alpha }_{1}, \dots, {\alpha }_{p}\right)}^{T}, \boldsymbol{\beta } = {\left({\beta }_{1}, \dots, {\beta }_{q}\right)}^{T},$ ${\boldsymbol{\lambda }}^{*} = {\left({\alpha }_{0}, {\boldsymbol{\alpha }}^{T}, {\boldsymbol{\beta }}^{T}\right)}^{T},$ $\boldsymbol{\lambda } = {\left(\gamma, {\boldsymbol{\lambda }}^{*{\rm T}}\right)}^{T},$ and write the true value of $\boldsymbol{\lambda }$ as ${\boldsymbol{\lambda }}^{0}$ . Suppose that the observation $\mathbf{Y} = \left({Y}_{1}, \dots, {Y}_{n}\right)$ is fitted with the model (3.1). The conditional likelihood function at time t is

${l}_{t}\left(\boldsymbol{\lambda }\right) = f\left({y}_{t}\left|{\mathcal{F}}_{t-1};\right.\boldsymbol{\lambda }\right) = \frac{{\theta }_{t}^{{Y}_{t}}}{{\left(\gamma \right)}_{{Y}_{t}}}\phi \left(1+{Y}_{t};\gamma +{Y}_{t};-{\theta }_{t}\right),$

where ${\theta }_{t} = \gamma {\alpha }_{0}+\gamma {\sum }_{i = 1}^{p}{\alpha }_{i}{Y}_{t-i}+{\sum }_{j = 1}^{q}{\beta }_{j}{\theta }_{t-j}$ . The conditional log-likelihood function is

$l\left(\boldsymbol{\lambda }\right) = \mathrm{ln}\prod\limits _{t = 2}^{n}{l}_{t}\left(\boldsymbol{\lambda }\right) = \sum\limits _{t = 2}^{n}\left\{{Y}_{t}\mathrm{ln}{\theta }_{t}-\mathrm{ln}\mathrm{\Gamma }\left(\gamma +{Y}_{t}\right)+\mathrm{ln}\mathrm{\Gamma }\left(\gamma \right)+h\left({\theta }_{t}\right)\right\},$

where $h\left({\theta }_{t}\right) = \mathrm{ln}\left[\phi \left(1+{Y}_{t}; \gamma +{Y}_{t}; -{\theta }_{t}\right)\right]$ . The score function is defined by

$\frac{\partial l\left(\boldsymbol{\lambda }\right)}{\partial \boldsymbol{\lambda }} = \sum\limits _{t = 2}^{n}\frac{\partial }{\partial \boldsymbol{\lambda }}\mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right),$

with

$\frac{\partial \mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right)}{\partial \gamma } = \frac{{Y}_{t}}{{\theta }_{t}}\frac{\partial {\theta }_{t}}{\partial \gamma }-\psi \left(\gamma +{Y}_{t}\right)+\psi \left(\gamma \right)+\frac{\partial }{\partial \gamma }h\left({\theta }_{t}\right) ,$

$\frac{\partial \mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right)}{\partial {\boldsymbol{\lambda }}^{*}} = \left[\frac{{Y}_{t}}{{\theta }_{t}}+\frac{\partial }{\partial {\theta }_{t}}h\left({\theta }_{t}\right)\right]\frac{\partial {\theta }_{t}}{\partial {\boldsymbol{\lambda }}^{*}} ,$

$\frac{{\partial \theta }_{t}}{\partial \gamma } = {\alpha }_{0}+{\sum }_{i = 1}^{p}{\alpha }_{i}{Y}_{t-i}+{\sum }_{j = 1}^{q}{\beta }_{j}\frac{{\partial \theta }_{t-j}}{\partial \gamma } , \frac{\partial {\theta }_{t}}{\partial {\alpha }_{0}} = \gamma +{\sum }_{j = 1}^{q}{\beta }_{j}\frac{\partial {\theta }_{t-j}}{\partial {\alpha }_{0}} ,$

$\frac{\partial {\theta }_{t}}{\partial {\alpha }_{i}} = \gamma {Y}_{t-i}+{\sum }_{j = 1}^{q}{\beta }_{j}\frac{\partial {\theta }_{t-j}}{\partial {\alpha }_{i}}, i = 1, \dots , p ,$

$\frac{\partial {\theta }_{t}}{\partial {\beta }_{j}} = {\theta }_{t-j}+{\sum }_{k = 1}^{q}{\beta }_{k}\frac{{\partial \theta }_{t-k}}{\partial {\beta }_{j}}, j = 1, \dots , q.$

where $\psi \left(x\right) = \frac{\partial }{\partial x}\left[\mathrm{ln}\mathrm{\Gamma }\left(x\right)\right] = \frac{1}{\mathrm{\Gamma }\left(x\right)}\frac{\partial }{\partial x}\left[\mathrm{\Gamma }\left(x\right)\right]$ is the digamma function (^[20]). The solution of the equation $\frac{\partial l\left(\boldsymbol{\lambda }\right)}{\partial \boldsymbol{\lambda }} = 0$ , if it exists, gives the conditional MLE of $\boldsymbol{\lambda }$ , denoted by $\widehat{\boldsymbol{\lambda }}$ .

The Hessian matrix is given by

${H}_{n}\left(\boldsymbol{\lambda }\right) = -\sum \limits_{t = 2}^{n}\frac{{\partial }^{2}}{\partial \boldsymbol{\lambda }\partial {\boldsymbol{\lambda }}^{T}}\mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right)$

with

$\frac{{\partial }^{2}\mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right)}{\partial {\gamma }^{2}} = \frac{{Y}_{t}}{{\theta }_{t}}\frac{{\partial }^{2}{\theta }_{t}}{\partial {\gamma }^{2}}-\frac{{Y}_{t}}{{\theta }_{t}^{2}}{\left(\frac{\partial {\theta }_{t}}{\partial \gamma }\right)}^{2}-{\psi }^{\text{'}}\left(\gamma +{y}_{t}\right)+{\psi }^{\text{'}}\left(\gamma \right)+\frac{{\partial }^{2}}{\partial {\gamma }^{2}}h\left({\theta }_{t}\right) ,$

$\frac{{\partial }^{2}\mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right)}{\partial \gamma \partial {\boldsymbol{\lambda }}^{\mathbf{*}}} = \left[-\frac{{Y}_{t}}{{\theta }_{t}^{2}}\frac{\partial {\theta }_{t}}{\partial \gamma }+\frac{{\partial }^{2}}{\partial \gamma \partial {\theta }_{t}}h\left({\theta }_{t}\right)\right]\frac{\partial {\theta }_{t}}{\partial {\boldsymbol{\lambda }}^{*}}+\left[\frac{{Y}_{t}}{{\theta }_{t}}+\frac{\partial }{\partial {\theta }_{t}}h\left({\theta }_{t}\right)\right]\frac{{\partial }^{2}{\theta }_{t}}{\partial \gamma \partial {\boldsymbol{\lambda }}^{*}} ,$

$\frac{{\partial }^{2}\mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right)}{\partial {\boldsymbol{\lambda }}^{*}\partial {\boldsymbol{\lambda }}^{*T}} = \left[-\frac{{Y}_{t}}{{\theta }_{t}^{2}}+\frac{{\partial }^{2}}{\partial {\theta }_{t}^{2}}h\left({\theta }_{t}\right)\right]\frac{\partial {\theta }_{t}}{\partial {\boldsymbol{\lambda }}^{*}}\frac{\partial {\theta }_{t}}{\partial {\boldsymbol{\lambda }}^{*T}}+\left[\frac{{Y}_{t}}{{\theta }_{t}}+\frac{\partial }{\partial {\theta }_{t}}h\left({\theta }_{t}\right)\right]\frac{{\partial }^{2}{\theta }_{t}}{\partial {\boldsymbol{\lambda }}^{*}\partial {\boldsymbol{\lambda }}^{*T}} ,$

$\frac{{\partial }^{2}{\theta }_{t}}{\partial {\alpha }_{0}^{2}} = 0, \frac{{\partial }^{2}{\theta }_{t}}{\partial {\alpha }_{i}^{2}} = 0, \frac{{\partial }^{2}{\theta }_{t}}{\partial {\alpha }_{0}\partial {\alpha }_{i}} = 0, i = 1, \dots , p,$

$\frac{{\partial }^{2}{\theta }_{t}}{\partial {\alpha }_{0}\partial {\beta }_{j}} = \frac{\partial {\theta }_{t-j}}{\partial {\alpha }_{0}}+{\sum }_{k = 1}^{q}{\beta }_{k}\frac{{\partial }^{2}{\theta }_{t-k}}{\partial {\alpha }_{0}\partial {\beta }_{j}}, \frac{{\partial }^{2}{\theta }_{t}}{\partial {\beta }_{j}^{2}} = 2\frac{\partial {\theta }_{t-j}}{\partial {\beta }_{j}}+{\sum }_{k = 1}^{q}{\beta }_{k}\frac{{\partial }^{2}{\theta }_{t-k}}{\partial {\beta }_{j}^{2}}, j = 1, \dots , q,$

$\frac{{\partial }^{2}{\theta }_{t}}{\partial {\alpha }_{i}\partial {\beta }_{j}} = \frac{\partial {\theta }_{t-j}}{\partial {\alpha }_{i}}+{\sum }_{k = 1}^{q}{\beta }_{k}\frac{{\partial }^{2}{\theta }_{t-k}}{\partial {\alpha }_{i}\partial {\beta }_{j}}, i = 1, \dots , p, j = 1, \dots , q ,$

$\frac{{\partial }^{2}{\theta }_{t}}{\partial {\beta }_{j}^{}\partial {\beta }_{k}^{}} = \frac{\partial {\theta }_{t-j}}{\partial {\beta }_{k}}+\frac{\partial {\theta }_{t-k}}{\partial {\beta }_{k}}+{\sum }_{l = 1}^{q}{\beta }_{l}\frac{{\partial }^{2}{\theta }_{t-l}}{\partial {\beta }_{k}^{}\partial {\beta }_{j}^{}}, j, k = 1, \dots , q$

where ${\psi }^{{{'}}}\left(x\right) = \frac{\partial }{\partial x}\left[\psi \left(x\right)\right]$ is the trigamma function. According to White ^[35], the standard errors of $\widehat{\boldsymbol{\lambda }}$ can be computed from the robust sandwich matrix ${H}_{n}^{-1}\left(\widehat{\boldsymbol{\lambda }}\right){S}_{n}\left(\widehat{\boldsymbol{\lambda }}\right){H}_{n}^{-1}\left(\widehat{\boldsymbol{\lambda }}\right)$ , where ${S}_{n}\left(\boldsymbol{\lambda }\right) = \sum _{t = 2}^{n}\frac{\partial \mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right)}{\partial \boldsymbol{\lambda }}\frac{\partial \mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right)}{\partial {\boldsymbol{\lambda }}^{T}}$ , and ${H}_{n}\left(\boldsymbol{\lambda }\right) = -\sum _{t = 2}^{n}\frac{{\partial }^{2}\mathrm{ln}{l}_{t}\left(\boldsymbol{\lambda }\right)}{\partial \boldsymbol{\lambda }\partial {\boldsymbol{\lambda }}^{T}}$ .

5. Monte Carlo simulation study

Since the autocorrelation function of AHP-INGARCH(1, 1) model is derived, the parameter of AHP-INGARCH(1, 1) model can also be estimated according to the Yule-Walker approach. It is of interest to compare the efficiency of the Yule-Walker (YW) estimator with the maximum likelihood (ML) estimator. In this section, we conduct a Monte Carlo simulation study to investigate the performance of the YW and ML estimators for the proposed AHP-INGARCH(1, 1) model. The simulations are computed by using the R programming language. The data set ${Y}_{1}, {Y}_{2}, \dots, {Y}_{n}$ is generated in accordance to model (3.1) for different sample sizes $n$ and different parameter values. Due to space constraints, we showed only the simulation results for the data with fixed mean equal to one and parameters ${(\alpha }_{0} = 0.6, {\alpha }_{1} = 0.3, {\beta }_{1} = 0.1)$ for several sample sizes ( $n = 100, 200\;\; \mathrm{a}\mathrm{n}\mathrm{d}\;\; 500)$ with over-dispersion $\left(\gamma = 2\right)$ , equi-dispersion $\left(\gamma = 1\right)$ and under-dispersion $\left(\gamma = 0.8\right)$ in . Similar results are obtained for other sets of parameter values considered. The parameter estimates and mean square errors for the parameters ${(\alpha }_{0}, {\alpha }_{1}, {\beta }_{1}, \gamma)$ are computed over 1000 replications. In the MLE estimation, we use sample mean as the initial mean value, namely ${\mu }_{1} = \frac{1}{n}{\sum }_{i = 1}^{n}{Y}_{i}$ .

Table 1. Yule-Walker (YW) and maximum likelihood estimates (MLE) for simulated AHP-INGARCH(1, 1) model with

${(\alpha }_{0} = 0.6, {\alpha }_{1} = 0.3, {\beta }_{1} = 0.1, \gamma)$ and different sample sizes

$n$ over 1000 replications (Mean squared errors in parentheses).

Dispersion	n		Estimates
Dispersion	n		${\widehat{\alpha }}_{0}$	${\widehat{\alpha }}_{1}$	${\widehat{\beta }}_{1}$	$\widehat{\gamma }$
$\gamma =2$ (over-dispersion)	100	YW	0.6688 (0.1866)	0.2825 (0.0188)	0.0486 (0.2298)	2.0157 (1.3083)
		MLE	0.5699 (0.0387)	0.2854 (0.0113)	0.1471 (0.0411)	2.0355 (0.9536)
	200	YW	0.6686 (0.0913)	0.2957 (0.0177)	0.0339 (0.1070)	2.0541 (0.6345)
		MLE	0.5905 (0.0249)	0.2907 (0.0070)	0.1193 (0.0262)	2.0767 (0.5949)
	500	YW	0.6189 (0.0257)	0.2910 (0.0036)	0.0859 (0.0290)	2.0191 (0.1926)
		MLE	0.5953 (0.0121)	0.2931 (0.0026)	0.1083 (0.0128)	2.0237 (0.1751)
$\gamma =1$ (equi-dispersion)	100	YW	0.6755 (0.2987)	0.2842 (0.0386)	0.0327 (0.3618)	1.0375 (0.1024)
		MLE	0.5794 (0.0388)	0.2688 (0.0114)	0.1472 (0.0413)	1.0436 (0.0871)
	200	YW	0.6670 (0.0868)	0.2798 (0.0095)	0.0499 (0.0924)	1.0186 (0.0490)
		MLE	0.5932 (0.0263)	0.2763 (0.0059)	0.1278 (0.0281)	1.0194 (0.0354)
	500	YW	0.6159 (0.0246)	0.2926 (0.0028)	0.0925 (0.0287)	1.0097 (0.0149)
		MLE	0.5947 (0.0130)	0.2923 (0.0024)	0.1145 (0.0145)	1.0130 (0.0128)
$\gamma =0.8$ (under-dispersion)	100	YW	0.6566 (0.1908)	0.2841 (0.0255)	0.0358 (0.2377)	0.8560 (0.0614)
		MLE	0.5611 (0.0353)	0.2759 (0.0107)	0.1418 (0.0379)	0.8513 (0.0387)
	200	YW	0.6418 (0.0710)	0.2828 (0.0125)	0.0510 (0.0877)	0.8205 (0.0192)
		MLE	0.5801 (0.0230)	0.2776 (0.0055)	0.1212 (0.0254)	0.8284 (0.0166)
	500	YW	0.6121 (0.0228)	0.2878 (0.0023)	0.0837 (0.0281)	0.8165 (0.0076)
		MLE	0.5892 (0.0110)	0.2880 (0.0021)	0.1073 (0.0128)	0.8185 (0.0064)

| Show Table

DownLoad: CSV

Based on the results from the simulation study, the parameter estimates approach the true values along with small mean square errors at all three types of dispersions as the sample size increases.

6. Comparison of INGARCH models

This section describes the analysis of three real-life data sets fitted by the Poisson, NB, DP, GP, COMP, and AHP distributed INGARCH-type modes. Based on the sample mean and variance, the first data set exhibits over-dispersion and the last two data sets exhibit under-dispersion. The MLE results are obtained using the R software through the Box constraints optimization (L-BFGS-B) function with the initial conditional mean equals to the sample mean. Statistical consistency between the predictive and observed distributions is investigated by plotting the histogram of probability integral transform (PIT) (see, Dawid ^[10]). According to Czado et al. ^[9], from the PIT histogram, under-dispersed predictive distribution is signified by a U-shaped histogram whereas over-dispersed predictive distribution is indicated by a hump or inverse-U shaped histogram. When central tendencies are biased, a skewed histogram is observed.

6.1. Polio series

The polio data is a time series of length 168, which has been fitted by Zhu ^[39] using the NB-INGARCH model. The data series is the monthly number of cases of poliomyelitis reported by the U.S. Centers for Disease Control from 1970 to 1983 with a sample mean of 1.33 and a sample variance of 3.50. These statistics indicate that the series is over-dispersed and the sample first-order autocorrelation coefficient (FOAC), $\widehat{\rho }\left(1\right) = 0.2948$ .

Figure 3 describes the original data, autocorrelation function (ACF) and partial autocorrelation function (PACF) of the polio series while Table 2 summarizes the result of six fitted INGARCH(1, 1) models.

Figure 3. Plots of time series, ACF and PACF of polio cases from 1970 to 1983.

DownLoad: Full-Size Img PowerPoint

Table 2. Polio series: Parameter estimates with Poisson-, NB-, DP-, GP-, COMP-, and AHP-INGARCH(1, 1) models, standard errors are shown in parentheses.

Model	${\widehat{\alpha }}_{0}$	${\widehat{\alpha }}_{1}$	${\widehat{\beta }}_{1}$	$\widehat{r}/\widehat{\gamma }/\widehat{\phi }/\widehat{\upsilon }/\widehat{\gamma }$	AIC	BIC
Poisson	0.6357 (0.1702)	0.3515 (0.0678)	0.1846 (0.1342)		562.08	571.40
NB	0.6075 (0.2275)	0.3643 (0.1029)	0.1982 (0.1858)	1.6346 (0.4326)	520.47	532.87
DP	0.6357 (0.2278)	0.3515 (0.0907)	0.1846 (0.1796)	0.5585 (0.0611)	529.33	541.73
GP	0.3645 (0.4105)	0.1647 (0.0859)	0.5689 (0.3497)	1.4089 (0.1083	528.08	540.48
COMP	0.0529 (0.0399)	0.1845 (0.0713)	0.1670 (0.1896)	0.2546 (0.0524)	524.37	536.77
AHP	0.6418 (0.2063)	0.4214 (0.1082)	0.1344 (0.1536)	4.1310 (2.0243)	521.15	533.55

| Show Table

DownLoad: CSV

Based on the Akaike information criterion (AIC) and Bayesian information criterion (BIC) for all the models, the AHP-INGARCH(1, 1) model is found to be comparable to the NB-INGARCH(1, 1) but better than the Poisson, DP-, GP- and COMP-INGARCH(1, 1) models for this over-dispersed polio dataset. It is worth noting that the values of the parameter estimates for NB-INGARCH(1, 1) model are different from Zhu ^[39] because they fixed the parameter $r = 2$ .

Figure 4 shows the PIT histogram of the six INGARCH(1, 1) models. Overall, Poisson-INGARCH(1, 1) model have a U-shaped histogram while DP-INGARCH(1, 1) model has a skewed histogram. Although not uniform, the NB-, GP-, COMP- and AHP-INGARCH(1, 1) have only a slight hump in the histogram indicating better performance than the Poisson- and DP-INGARCH(1, 1) models.

Figure 4. PIT histogram for Poisson-, NB-, DP-, GP-, COMP- and AHP-INGARCH (1, 1) models of polio series.

DownLoad: Full-Size Img PowerPoint

Comparison of the estimated means, variances, dispersion index and FOACs within the fitted Poisson, NB, DP, GP, COMP and AHP models for polio data and another two under-dispersion data sets that will be discussed later in the next sections are summarized in Table 3. The dispersion index for NB- and AHP-INGARCH(1, 1) models are closest to the dispersion index for the polio data compared to the Poisson-, DP-, GP- and COMP-INGARCH(1, 1) models.

Table 3. Sample and estimated mean, variance, dispersion index (

${I}_{Z}$ ) and FOAC under the Poisson, NB, DP, GP, COMP and AHP models.

Data	Model	Sample	Poisson	NB	DP	GP	COMP	AHP
Polio	Mean	1.3333	1.3701	1.3888	1.3701	1.3683	1.9624	1.4449
	Variance	3.5050	1.6076	3.4813	2.8786	2.8756	2.0349	4.0534
	${I}_{Z}$	2.6288	1.1733	2.5067	2.1010	2.1016	1.0370	2.8053
	FOAC	0.2948	0.3787	0.3966	0.3787	0.1963	0.1908	0.4489
IP	Mean	1.2863	1.2917	1.2916	1.2917	1.2917	1.3154	1.2913
	Variance	1.2052	1.4039	1.4039	1.4589	1.1777	1.2184	1.2806
	${I}_{Z}$	0.9370	1.0869	1.0869	1.1294	0.9117	0.9263	0.9917
	FOAC	0.2925	0.2827	0.6927	0.2827	0.2834	0.2825	0.2830
COVID-19	Mean	0.8416	0.5032	0.5045	0.5045	0.4001	0.7396	0.6022
	Variance	0.7347	0.5814	0.5829	0.4981	0.3315	0.5985	0.5547
	${I}_{Z}$	0.8729	1.1555	1.1554	0.9873	0.8285	0.8091	0.9212
	FOAC	0.2374	0.2082	0.2081	0.2081	0.1891	0.2383	0.2993

| Show Table

DownLoad: CSV

6.2. Internet Protocol (IP) count series

The IP counts data is a time series of length 241 that gives the number of different IP addresses registered within periods of 2-min length at the server of the Department of Statistics of the University of W $\ddot{u}$ rzburg in 29 November 2005 between 10 a.m and 6 p.m. This data set has been investigated by Weiß ^[30] and Zhu ^[40,41]. The data examined is under-dispersed since the variance (1.2052) is smaller than the mean (1.2863) with the sample FOAC of 0.2925. Figure 5 presents the IP counts time series, ACF and PACF, respectively.

Figure 5. Plots of time series, ACF and PACF of IP counts series.

DownLoad: Full-Size Img PowerPoint

Zhu ^[41] fitted the Poisson INAR(1), the Poisson-, DP- and GP- INARCH(1) and INGARCH(1, 1) models to the data and claimed that the GP-INARCH(1) model is more appropriate for this time series. Comparison of existing INARCH(1) and AHP-INARCH(1) models are summarized in . It is worth noting that the over-dispersed NB-INARCH(1) model of Zhu ^[39] is added in our analysis for comparison. As seen from , the estimated value of the NB-INARCH(1) parameter $\widehat{r}$ is large and significant, but unreliable. This is because the differentiation of the likelihood function with respect to the parameter $r$ is problematic as reported in Zhu ^[39]. We obtain the same conclusion as Zhu ^[40], that is, (ⅰ) the estimated dispersion index of GP-, COMP- and AHP-INARCH(1) models are less than 1, which indicate that the data set is under-dispersed count time series, and Poisson-, DP- and NB- INARCH(1) models wrongly indicate that the data set is over-dispersed; (ⅱ) GP model performs better in AIC and Poisson INARCH(1) performs better in BIC, but the difference of AIC and BIC values between other models are rather small. Typically, a difference in AIC value less than 2 is considered not significant (Zhu ^[40]). Based on Figure 6, the PIT histogram of the AHP, GP and COMP models shows approximate uniformity, while Poisson-, NB- and DP-INARCH(1) are slightly hump-shaped.

Table 4. IP counts series: Parameter estimates with Poisson-, NB-, DP-, GP-, COMP- and AHP-INARCH(1) models, standard errors are shown in parentheses.

Model	${\widehat{\alpha }}_{0}$	${\widehat{\alpha }}_{1}$	$\widehat{r}/\widehat{\gamma }/\widehat{\phi }/\widehat{\upsilon }/\widehat{\gamma }$	AIC	BIC
Poisson	0.9265 (0.1007)	0.2827 (0.0684)		675.20	682.15
NB	0.9264 (0.1007)	0.2828 (0.0684)	39722.23 (253.85)	677.20	687.62
DP	0.9265 (0.1026)	0.2827 (0.0697)	0.9623 (0.0878)	677.02	687.44
GP	0.9256 (0.0917)	0.2834 (0.0622)	0.9157 (0.0409)	673.78	684.19
COMP	1.0492 (0.1157)	0.2825 (0.0636)	1.2674 (0.1790)	674.81	685.22
AHP	0.9258 (0.0974)	0.2830 (0.0653)	0.8796 (0.0937)	675.90	686.32

| Show Table

DownLoad: CSV

Figure 6. PIT histogram for Poisson-, NB-, DP-, GP-, COMP- and AHP-INARCH (1) models of IP counts series.

DownLoad: Full-Size Img PowerPoint

6.3. COVID-19 series

The COVID-19 data is a time series of length 101 that gives the number of daily COVID-19 new deaths recorded in Saudi Arabia between 26^th January 2023 to 6^th May 2023. This data is publicly available at the website https://ourworldindata.org/. The data examined is under-dispersed since the variance (0.7347) is smaller than the mean (0.8416) with the sample FOAC of 0.2374. Figure 7 presents the COVID-19 new deaths time series, ACF and PACF, respectively.

Figure 7. Plots of time series, ACF and PACF of daily COVID-19 new deaths.

DownLoad: Full-Size Img PowerPoint

summarizes the results of six INGARCH(1, 1) models for the COVID-19 new deaths data. Based on AIC and BIC, the AHP-INGARCH(1-1) model is the best. The estimated value of the NB-INGARCH(1, 1) parameter $\widehat{r}$ is again large and significant, but unreliable. Besides the over-dispersed Poisson- and NB-INGARCH(1, 1) models, the dispersion index of DP-, GP-, COMP- and AHP-INGARCH(1, 1) are all less than 1 which indicate that these models are capable of capturing under-dispersed feature from the empirical data (see Table 3). All the PIT histrograms in Figure 8 visibly deviate from uniformity (with a slight preference for the AHP-INGARCH(1, 1) model). In particular, the PIT histrograms of Poisson-, NB- and DP-INGARCH(1, 1) models are slightly hump-shaped while GP-, COMP- and AHP-INGARCH(1, 1) models are slightly U-shaped.

Table 5. Daily COVID-19 new deaths: Parameter estimates with Poisson-, NB-, DP-, GP-, COMP-, and AHP-INGARCH(1, 1) models, standard errors are shown in parentheses.

Model	${\widehat{\alpha }}_{0}$	${\widehat{\alpha }}_{1}$	${\widehat{\beta }}_{1}$	$\widehat{r}/\widehat{\gamma }/\widehat{\phi }/\widehat{\upsilon }/\widehat{\gamma }$	AIC	BIC
Poisson	0.0130 (0.0268)	0.0890 (0.0472)	0.8851 (0.0645)		223.01	230.76
NB	0.0131 (0.0268)	0.0891 (0.0472)	0.8850 (0.0646)	16860.79 (267.79)	225.01	235.31
DP	0.0131 (0.0248)	0.0891 (0.0436)	0.8850 (0.0597)	1.1702 (0.1655)	223.83	234.13
GP	0.0075 (0.0225)	0.0736 (0.0360)	0.9077 (0.0537)	0.8503 (0.0502)	219.61	229.91
COMP	0.0520 (0.0413)	0.1012 (0.0423)	0.8710 (0.0580)	1.9497 (0.4518)	219.35	229.64
AHP	0.0172 (0.0265)	0.1220 (0.0501)	0.8493 (0.0639)	0.4912 (0.1070)	217.55	227.85

| Show Table

DownLoad: CSV

Figure 8. PIT histogram for Poisson-, NB-, DP-, GP-, COMP- and AHP-INARCH(1) models of daily COVID-19 new deaths series.

DownLoad: Full-Size Img PowerPoint

7. Conclusions

This work introduces a new family of AHP-INGARCH models for analysing over-dispersed and under-dispersed count time series data. The advantages of the AHP-INGARCH-type model are (ⅰ) the ease in obtaining the model mean and variance compared to distributions like the COMP and DP distributions, making the AHP distribution a useful choice for an INGARCH model, and (ⅱ) the ability to accommodate a wider range of dispersion, placing the AHP distribution as a flexible model for applications. The application of the AHP-INGARCH models to three real-life data sets clearly demonstrates the model competitiveness in studying both over-dispersed and under-dispersed data. Since $\gamma$ is the dispersion parameter of AHP distribution, one of the potential future research is to relax the assumption of constant dispersion by allowing time-varying dispersion. Another direction of future research is to extend the results to multivariate cases.

Use of AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgements

The authors would like to thank the editor and the referees for their valuable comments.

Seng Huat Ong is supported by the Ministry of Higher Education grant FRGS/1/2020/STG06/SYUC/02/1 and UCSI University grant REIG-FBM-2022/050. The authors would like to thank INTI International University for funding the publication of this work.

Conflict of interest

All authors declare no conflicts of interest in this paper.

Appendix A

A.1. Parameter constraint of AHP distribution

Let $Z\sim AHP\left(\theta, \gamma \right)$ . The probability mass function of $Z$ can be written as

$P\left(Z = z\right) = \frac{{\theta }^{z}}{{\left(\gamma \right)}_{z}}\phi \left(1+z;\gamma +z;-\theta \right) = \frac{{\theta }^{z}{e}^{-\theta }}{{\left(\gamma \right)}_{z}}\phi \left(\gamma -1;\gamma +z;\theta \right)$

where the second equality is obtained from Johnson et al. ^[20]. Since $\frac{{\theta }^{z}{e}^{-\theta }}{{\left(\gamma \right)}_{z}} > 0$ for $\theta > 0$ and $\gamma > 0$ . The positivity of the pmf of $Z$ depends on the confluent hypergeometric function

$\phi \left(\gamma -1;\gamma +z;\theta \right) = \sum \limits_{k = 0}^{\infty }\frac{{\left(\gamma -1\right)}_{k}{\theta }^{k}}{{\left(\gamma +z\right)}_{k}k!}.$

It is easy to see that $\phi \left(\gamma -1;\gamma +z; \theta \right) > 0$ when $\gamma \ge 1$ . Hence, the positivity of the pmf of $Z$ is guaranteed for $\gamma \ge 1$ .

For $0 < \gamma < 1$ , we note that ${\left(\gamma +z\right)}_{k}\ge {\left(\gamma \right)}_{k}$ for all $z\ge 0$ and $k\ge 0.$ Since

$\phi \left(\gamma -1;\gamma +z;\theta \right) = \sum \limits_{k = 0}^{\infty }\frac{{\left(\gamma -1\right)}_{k}{\theta }^{k}}{{\left(\gamma +z\right)}_{k}k!}\ge \sum \limits_{k = 0}^{\infty }\frac{{\left(\gamma -1\right)}_{k}{\theta }^{k}}{{\left(\gamma \right)}_{k}k!} = \phi \left(\gamma -1;\gamma ;\theta \right),$

we have

$P\left(Z = z\right) = \frac{{\theta }^{z}{e}^{-\theta }}{{\left(\gamma \right)}_{z}}\phi \left(\gamma -1;\gamma +z;\theta \right)\ge P\left(Z = 0\right) = {e}^{-\theta }\phi \left(\gamma -1;\gamma ;\theta \right)$

for all $z\ge 1$ . Thus, for $0 < \gamma < 1$ , the positivity of the pmf of $Z$ is guaranteed as long as $P\left(Z = 0\right) > 0$ . This condition can be further simplified as

$\phi \left(\gamma -1;\gamma ;\theta \right) = \sum \limits_{k = 0}^{\infty }\frac{{\left(\gamma -1\right)}_{k}{\theta }^{k}}{{\left(\gamma \right)}_{k}k!} = \sum \limits_{k = 0}^{\infty }\frac{\mathrm{\Gamma }\left(\gamma -1+k\right)\mathrm{\Gamma }\left(\gamma \right){\theta }^{k}}{\mathrm{\Gamma }\left(\gamma -1\right)\mathrm{\Gamma }\left(\gamma +k\right)k!}$

$= \left(\gamma -1\right)\sum \limits_{k = 0}^{\infty }\frac{{\theta }^{k}}{\left(\gamma -1+k\right)k!} > 0$

It is worth to note that the confluent hypergeometric function $\phi \left(\gamma -1;\gamma; \theta \right)$ is a decreasing function on $\theta$ . Hence, for a given $\gamma \in \left(\mathrm{0, 1}\right)$ , $\phi \left(\gamma -1;\gamma; \theta \right) > 0$ iff $\theta < {\theta }^{*}$ , where ${\theta }^{*}$ is the solution of $\phi \left(\gamma -1;\gamma; {\theta }^{*}\right) = 0$ .

A.2. Proof of Theorem 3.1

(ⅰ) Let ${\mu }_{Y} = E\left[{Y}_{t}\right]$ denote the unconditional expectation of ${Y}_{t}$ if it exists. Then by the tower property of conditional expectation, we have

${\mu }_{Y} = E\left[E\left[{Y}_{t}|{\mathcal{F}}_{t-1}\right]\right] = {\alpha }_{0}+\sum \limits_{i = 1}^{p}{\alpha }_{i}E\left[{Y}_{t-i}\right]+\sum\limits_{j = 1}^{q}{\beta }_{j}E\left[{\mu }_{t-j}\right] = {\alpha }_{0}+\sum \limits_{i = 1}^{p}{\alpha }_{i}{\mu }_{Y}+\sum \limits_{j = 1}^{q}{\beta }_{j}{\mu }_{Y} = \\ \frac{{\alpha }_{0}}{\left(1-\sum _{i = 1}^{p}{\alpha }_{i}-\sum _{j = 1}^{q}{\beta }_{j}\right)},$

where $E\left[{\mu }_{t-j}\right] = E\left[E\left[{Y}_{t-j}|{\mathcal{F}}_{t-j-1}\right]\right] = {\mu }_{Y}$ by the tower property of conditional expectation.

(ⅱ) By the definition of covariance and the tower property, we have

$\begin{array}{cc}Cov\left[{Y}_{t}-{\mu }_{t}, {\mu }_{t-k}\right]& = E\left[E\left[\left({Y}_{t}-{\mu }_{t}\right)\left({\mu }_{t-k}-{\mu }_{Y}\right)|{\mathcal{F}}_{t-1}\right]\right]\\ & = E\left[\left({\mu }_{t-k}-{\mu }_{Y}\right)\bullet E\left[{Y}_{t}-{\mu }_{t}\left|{\mathcal{F}}_{t-1}\right.\right]\right]\\ \begin{array}{c}\\ \end{array}& \begin{array}{c} = E\left[\left({\mu }_{t-k}-{\mu }_{Y}\right)\bullet \left({\mu }_{Y}-{\mu }_{Y}\right)\right]\\ = 0\end{array}\end{array}$

for $k\ge 0$ , where ${\mu }_{t}$ is ${\mathcal{F}}_{t-1}$ measurable, i.e, $E\left[{\mu }_{t}|{\mathcal{F}}_{t-1}\right] = {\mu }_{t}.$ The proof is completed by noticing that

$Cov\left[{Y}_{t}-{\mu }_{t}, {\mu }_{t-k}\right] = Cov\left[{Y}_{t}, {\mu }_{t-k}\right]-Cov\left[{\mu }_{t}, {\mu }_{t-k}\right] = 0.$

For $k < 0$ , by the same argument, we have

$\begin{array}{cc}Cov\left[{Y}_{t-k}-{\mu }_{t-k}, {Y}_{t}\right]& = E\left[E\left[\left({Y}_{t-k}-{\mu }_{t-k}\right)\left({Y}_{t}-{\mu }_{Y}\right)\left|{\mathcal{F}}_{t-1}\right.\right]\right]\\ & = E\left[\left({Y}_{t}-{\mu }_{Y}\right)\bullet E\left[{Y}_{t-k}-{\mu }_{t-k}\left|{\mathcal{F}}_{t-k-1}\right.\right]\right]\\ \begin{array}{c}\\ \end{array}& \begin{array}{c} = E\left[\left({Y}_{t}-\mu \right)\bullet \left({\mu }_{t-k}-{\mu }_{t-k}\right)\right]\\ = 0\end{array}\end{array}$

Again, we use the property that ${\mu }_{t-k}$ is ${\mathcal{F}}_{t-k-1}$ measurable, i.e., $E\left[{\mu }_{t-k}|{\mathcal{F}}_{t-k-1}\right] = {\mu }_{t-k}$ and $E\left[{Y}_{t-k}\left|{\mathcal{F}}_{t-k-1}\right.\right] = {\mu }_{t-k}$ . The proof is completed by noticing that

$Cov\left[{Y}_{t-k}-{\mu }_{t-k}, {Y}_{t}\right] = Cov\left[{Y}_{t-k}, {Y}_{t}\right]-Cov\left[{\mu }_{t-k}, {Y}_{t}\right] = 0.$

(ⅲ) Finally, applying part (ⅱ), we have for $k\ge 0,$

$\begin{array}{cc}{\gamma }_{\mu }\left(k\right)& = Cov\left[{\mu }_{t}, {\mu }_{t-k}\right] = Cov\left[\left({\alpha }_{0}+\sum\limits_{i = 1}^{p}{\alpha }_{i}{Y}_{t-i}+\sum\limits_{j = 1}^{q}{\beta }_{j}{\mu }_{t-j}\right), {\mu }_{t-k}\right]\\ & = \sum\limits_{i = 1}^{p}{\alpha }_{i}\bullet Cov\left[{Y}_{t-i}, {\mu }_{t-k}\right]+\sum\limits_{j = 1}^{q}{\beta }_{j}\bullet Cov\left[{\mu }_{t-j}, {\mu }_{t-k}\right]\\ \begin{array}{c}\\ \end{array}& \begin{array}{c} = \sum\limits_{i = 1}^{min\left(k, q\right)}{\alpha }_{i}\bullet Cov\left[{\mu }_{t-i}, {\mu }_{t-k}\right]+\sum\limits_{i = k+1}^{p}{\alpha }_{i}\bullet Cov\left[{Y}_{t-i}, {Y}_{t-k}\right]+\sum\limits_{j = 1}^{q}{\beta }_{j}{\gamma }_{\mu }\left(\left|k-j\right|\right)\\ = \sum\limits_{i = 1}^{min\left(k, q\right)}{\alpha }_{i}\bullet {\gamma }_{\mu }\left(\left|k-i\right|\right)+\sum\limits_{i = k+1}^{p}{\alpha }_{i}\bullet {\gamma }_{Y}\left(i-k\right)+\sum\limits_{j = 1}^{q}{\beta }_{j}{\gamma }_{\mu }\left(\left|k-j\right|\right).\end{array}\end{array}$

Again, applying part (ⅱ), we obtain for $k\ge 1$ ,

$\begin{array}{cc}{\gamma }_{Y}\left(k\right)& = Cov\left[{Y}_{t}, {Y}_{t-k}\right] = Cov\left[{\mu }_{t}, {Y}_{t-k}\right] = Cov\left[\left({\alpha }_{0}+\sum\limits_{i = 1}^{p}{\alpha }_{i}{Y}_{t-i}+\sum\limits_{j = 1}^{q}{\beta }_{j}{\mu }_{t-j}\right), {Y}_{t-k}\right]\\ & = \sum\limits_{i = 1}^{p}{\alpha }_{i}\bullet Cov\left[{Y}_{t-i}, {Y}_{t-k}\right]+\sum\limits_{j = 1}^{q}{\beta }_{j}\bullet Cov\left[{\mu }_{t-j}, {Y}_{t-k}\right]\\ \begin{array}{c}\\ \end{array}& \begin{array}{c} = \sum\limits_{i = 1}^{p}{\alpha }_{i}\bullet {\gamma }_{Y}\left(\left|k-i\right|\right)+\sum\limits_{j = 1}^{min\left(k-1, q\right)}{\beta }_{j}\bullet Cov\left[{Y}_{t-j}, {Y}_{t-k}\right]+\sum\limits_{j = k}^{q}{\beta }_{j}\bullet Cov\left[{\mu }_{t-j}, {\mu }_{t-k}\right]\\ = \sum\limits_{i = 1}^{p}{\alpha }_{i}{\gamma }_{Y}\left(\left|k-i\right|\right)+\sum\limits_{j = 1}^{min\left(k-1, q\right)}{\beta }_{j}{\gamma }_{Y}\left(k-j\right)+\sum\limits_{j = k}^{q}{\beta }_{j}{\gamma }_{\mu }\left(j-k\right).\end{array}\end{array}$

A.3. Proof of Special Case AHP-INGARCH(1, 1)

(ⅰ) Unconditional variance

From Eq (3.2), we obtain for $k\ge 2$ that

${\gamma }_{Y}\left(k\right) = {\alpha }_{1}{\gamma }_{Y}\left(k-1\right)+{\beta }_{1}{\gamma }_{Y}\left(k-1\right) = {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k-1}{\gamma }_{Y}\left(1\right).$ (*)

Again, from Eq (3.2), for $k = 1$ , we have

${\gamma }_{Y}\left(1\right) = {\alpha }_{1}{\gamma }_{Y}\left(0\right)+{\beta }_{1}{\gamma }_{\mu }\left(0\right) = {\alpha }_{1}V\left({Y}_{t}\right)+{\beta }_{1}V\left({\mu }_{t}\right)$

$= {\alpha }_{1}\left(E\left[V\left({Y}_{t}|{\mathcal{F}}_{t-1}\right)\right]+\mathrm{V}\left(E\left[{Y}_{t}|{\mathcal{F}}_{t-1}\right]\right)\right)+{\beta }_{1}V\left({\mu }_{t}\right) \\ = {\alpha }_{1}\left(E\left[{\mu }_{t}\left(1+{\mu }_{t}\left(\frac{\gamma -1}{\gamma +1}\right)\right)\right]+V\left({\mu }_{t}\right)\right)+{\beta }_{1}V\left({\mu }_{t}\right)$

$= {\alpha }_{1}\left(E\left({\mu }_{t}\right)+E\left[{\mu }_{t}^{2}\right]\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}\right)+\left({\alpha }_{1}+{\beta }_{1}\right)V\left({\mu }_{t}\right)$

$= {\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}\left[V\left({\mu }_{t}\right)+{\mu }^{2}\right]\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+\left({\alpha }_{1}+{\beta }_{1}\right)V\left({\mu }_{t}\right) \\ = {\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+\left({\alpha }_{1}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+{\alpha }_{1}+{\beta }_{1}\right)V\left({\mu }_{t}\right)$

$= {\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+\left(\frac{{2\alpha }_{1}\gamma +{\beta }_{1}\left(\gamma +1\right)}{\left(\gamma +1\right)}\right)V\left({\mu }_{t}\right)$

(**)

To determine an expression for $V\left({\mu }_{t}\right)$ , note first that for $k\ge 1$ , Eq (3.3) can be written as

${\gamma }_{\mu }\left(k\right) = {\alpha }_{1}{\gamma }_{\mu }\left(k-1\right)+{\beta }_{1}{\gamma }_{\mu }\left(k-1\right) = {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k}V\left({\mu }_{t}\right).$

For $k = 0$ , Eq (3.3) can be written as

${\gamma }_{\mu }\left(0\right) = V\left({\mu }_{t}\right) = {\alpha }_{1}{\gamma }_{Y}\left(1\right)+{\beta }_{1}{\gamma }_{\mu }\left(1\right)$

$= {\alpha }_{1}\left[{\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+\left({\alpha }_{1}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+{\alpha }_{1}+{\beta }_{1}\right)V\left({\mu }_{t}\right)\right]+{\beta }_{1}\left[\left({\alpha }_{1}+{\beta }_{1}\right)V\left({\mu }_{t}\right)\right] \\ = {\alpha }_{1}^{2}{\mu }_{Y}+{\alpha }_{1}^{2}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+\left[{\alpha }_{1}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+{\left({\alpha }_{1}+{\beta }_{1}\right)}^{2}\right]V\left({\mu }_{t}\right)$

$= \left({\alpha }_{1}^{2}{\mu }_{Y}+{\alpha }_{1}^{2}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}\right)/\left(1-{\alpha }_{1}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}-{\left({\alpha }_{1}+{\beta }_{1}\right)}^{2}\right)$

$= \frac{{\alpha }_{1}^{2}{\mu }_{Y}\left(\gamma +1\right)+{\alpha }_{1}^{2}{\mu }_{Y}^{2}\left(\gamma -1\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}.$

Therefore, the unconditional variance of ${Y}_{t}$ can be obtained from

$V\left({Y}_{t}\right) = E\left[V\left({Y}_{t}|{\mathcal{F}}_{t-1}\right)\right]+\mathrm{V}\left(E\left[{Y}_{t}|{\mathcal{F}}_{t-1}\right]\right) = E\left[{\mu }_{t}\left(1+{\mu }_{t}\left(\frac{\gamma -1}{\gamma +1}\right)\right)\right]+V\left({\mu }_{t}\right)\\ = {\mu }_{Y}+{\mu }_{Y}^{2}\frac{\gamma -1}{\gamma +1}+\left(\frac{2\gamma }{\gamma +1}\right)V\left({\mu }_{t}\right)\\ = \left({\mu }_{Y}+{\mu }_{Y}^{2}\frac{\gamma -1}{\gamma +1}\right)+\left(\frac{2\gamma }{\gamma +1}\right)\frac{{\alpha }_{1}^{2}{\mu }_{Y}\left(\gamma +1\right)+{\alpha }_{1}^{2}{\mu }_{Y}^{2}\left(\gamma -1\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}} \\= \left({\mu }_{Y}+{\mu }_{Y}^{2}\frac{\gamma -1}{\gamma +1}\right)+\frac{2\gamma {\alpha }_{1}^{2}\left({\mu }_{Y}+{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\gamma +1}\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}\\ = \left({\mu }_{Y}+{\mu }_{Y}^{2}\frac{\gamma -1}{\gamma +1}\right)\left(1+\frac{2\gamma {\alpha }_{1}^{2}}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}\right)\\ = \left({\mu }_{Y}+{\mu }_{Y}^{2}\frac{\gamma -1}{\gamma +1}\right)\left(\frac{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}\right).$

(ⅱ) Autocovariance function

Given that $V\left({\mu }_{t}\right) = \frac{{\alpha }_{1}^{2}{\mu }_{Y}\left(\gamma +1\right)+{\alpha }_{1}^{2}{\mu }_{Y}^{2}\left(\gamma -1\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}} = \frac{{\alpha }_{1}\left(\gamma +1\right)\left({\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}$ and from Eqs (*) and (**), we obtain for $k\ge 2$ that

${\gamma }_{Y}\left(k\right) = {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k-1}\left[{\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+\left(\frac{{2\alpha }_{1}\gamma +{\beta }_{1}\left(\gamma +1\right)}{\left(\gamma +1\right)}\right)V\left({\mu }_{t}\right)\right]$

$= {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k-1}\left[{\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}+\left(\frac{{2\alpha }_{1}\gamma +{\beta }_{1}\left(\gamma +1\right)}{\left(\gamma +1\right)}\right)\left(\frac{{\alpha }_{1}\left(\gamma +1\right)\left({\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}\right)\right]$

$= {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k-1}\left({\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}\right)\left[1+\frac{2{\gamma \alpha }_{1}^{2}+{{\alpha }_{1}\beta }_{1}\left(\gamma +1\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}\right]$

$= {\left({\alpha }_{1}+{\beta }_{1}\right)}^{k-1}\left({\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}\right)\left[\frac{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-{\alpha }_{1}{\beta }_{1}\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}\right].$

Note that for $k = 1$ , we have

${\gamma }_{Y}\left(1\right) = \left({\alpha }_{1}{\mu }_{Y}+{\alpha }_{1}{\mu }_{Y}^{2}\frac{\left(\gamma -1\right)}{\left(\gamma +1\right)}\right)\left[\frac{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-{\alpha }_{1}{\beta }_{1}\right)}{\left(\gamma +1\right)\left({1-\beta }_{1}^{2}-2{\alpha }_{1}{\beta }_{1}\right)-2{\gamma \alpha }_{1}^{2}}\right].$

References

[1]	J. Rafferty, C. D. Nugent, J. Liu, L. Chen, From activity recognition to intention recognition for assisted living within smart homes, IEEE Trans. Human Machine Syst., 47 (2017), 368–379. https://doi.org/10.1109/THMS.2016.2641388 doi: 10.1109/THMS.2016.2641388
[2]	Y. Sun, Z. Zhang, I Kakkos, G. K. Matsopoulos, J. J. Yuan, J. Suckling, Inferring the individual psychopathologic deficits with structural connectivity in a longitudinal cohort of Schizophrenia, IEEE J. Biomed. Health Informa., 26 (2022), 2536–2546. https://doi.org/10.1109/JBHI.2021.3139701 doi: 10.1109/JBHI.2021.3139701
[3]	Z. Guo, L. Zhao, J. Yuan, H. Yu, MSANet: Multiscale aggregation network integrating spatial and channel information for Lung nodule detection, IEEE J. Biomed. Health Inform., 26 (2022), 2547–2558. https://doi.org/10.1109/JBHI.2021.3131671 doi: 10.1109/JBHI.2021.3131671
[4]	J. W. Li, S. Barma, P. Un Mak, F. Chen, C. Li, M. T. Li, et al., Single-channel selection for EEG-based emotion recognition using brain rhythm sequencing, IEEE J. Biomed. Health Inform., 26 (2022), 2493–2503. https://doi.org/10.1109/JBHI.2022.3148109 doi: 10.1109/JBHI.2022.3148109
[5]	C. Finn, I. Goodfellow, S. Levine, Unsupervised learning for physical interaction through video prediction, arXiv: 1605.07157, 2016. https://doi.org/10.48550/arXiv.1605.07157
[6]	L. Liu, L. Cheng, Y. Liu, Y. Jia, D. S. Rosenblum, Recognizing complex activities by a probabilistic interval-based model, In: Proceedings of the thirtieth AAAI conference on artificial intelligence (AAAI'16), AAAI Press, 2016, 1266–1272. https://doi.org/10.5555/3015812.3015999
[7]	Z. Cao, T. Simon, S. E. Wei, Y. Sheikh, Realtime multi-person 2D pose estimation using part affinity fields, arXiv: 1611.08050, 2016. https://doi.org/10.48550/arXiv.1611.08050
[8]	R. Vemulapalli, F. Arrate, R. Chellappa, Human action recognition by representing 3D skeletons as points in a Lie group, In: 2014 IEEE Conference on computer vision and pattern recognition, 2014,588–595. https://doi.org/10.1109/CVPR.2014.82
[9]	K. Fragkiadaki, S. Levine, P. Felsen, J. Malik, Recurrent network models for human dynamics, In: IEEE International conference on computer vision (ICCV), 2015, 4346–4354. https://doi.org/10.1109/ICCV.2015.494
[10]	A. Jain, A. R. Zamir, S. Savarese, A. Saxena, Structural-RNN: Deep learning on spatio-temporal graphs, In: 2016 IEEE Conference on computer vision and pattern recognition (CVPR), 2016, 5308–5317. https://doi.org/10.1109/CVPR.2016.573
[11]	J. Redmon, A. Farhadi, YOLOv3: An incremental improvement, arXiv: 1804.02767, 2018. https://doi.org/10.48550/arXiv.1804.02767
[12]	K. Smagulova, A. P. James, A survey on LSTM memristive neural network architectures and applications, Eur. Phys. J. Spec. Top., 228 (2019), 2313–2324. https://doi.org/10.1140/epjst/e2019-900046-x doi: 10.1140/epjst/e2019-900046-x
[13]	J. Hu, Z. Fan, J. Liao, L. Liu, Predicting long-term skeletal motions by a spatio-temporal hierarchical recurrent network, arXiv: 1911.02404, 2019. https://doi.org/10.48550/arXiv.1911.02404
[14]	C. Li, P. Wang, S. Wang, Y. Hou, W. Li, Skeleton-based action recognition using LSTM and CNN, In: 2017 IEEE International conference on multimedia & expo workshops (ICMEW), 2017,585–590. https://doi.org/10.1109/ICMEW.2017.8026287
[15]	Q. Huang, L. Jia, G. Ren, X. Wang, C. Liu, Extraction of vascular wall in carotid ultrasound via a novel boundary-delineation network, Eng. Appl. Artif. Intell., 121 (2023), 106069. https://doi.org/10.1016/j.engappai.2023.106069 doi: 10.1016/j.engappai.2023.106069
[16]	J. Liu, Y. Wang, Y. Liu, S. Xiang, C. Pan, 3D PostureNet: A unified framework for skeleton-based posture recognition, Pattern Recognition Lett., 140 (2020), 143–149. https://doi.org/10.1016/j.patrec.2020.09.029 doi: 10.1016/j.patrec.2020.09.029
[17]	P. Wang, J. Wen, C. Si, Y. Qian, L. Wang, Contrast-reconstruction representation learning for self-supervised skeleton-based action recognition, IEEE Trans. Image Process., 31 (2022), 6224–6238. https://doi.org/10.1109/TIP.2022.3207577 doi: 10.1109/TIP.2022.3207577
[18]	A. Krizhevsky, I. Sutskever, G. E. Hinton, ImageNet classification with deep convolutional neural networks, Commun. ACM, 60 (2017), 84–90. https://doi.org/10.1145/3065386 doi: 10.1145/3065386
[19]	T. M. Taha, R. Hasan, C. Yakopcic, M. R. McLean, Exploring the design space of specialized multicore neural processors, In: 2013 International joint conference on neural networks (IJCNN), 2013, 1–8. https://doi.org/10.1109/IJCNN.2013.6707074
[20]	L. Chua, Memristor-The missing circuit element, IEEE Trans. Circuit Theory, 18 (1971), 507–519. https://doi.org/10.1109/TCT.1971.1083337 doi: 10.1109/TCT.1971.1083337
[21]	S. Wen, R. Hu, Y. Yang, T. Huang, Z. Zeng, Y. D. Song, Memristor-based echo state network with online least mean square, IEEE Trans. Syst. Man Cybernet., 49 (2019), 1787–1796. https://doi.org/10.1109/TSMC.2018.2825021 doi: 10.1109/TSMC.2018.2825021
[22]	S. H. Jo, K. H. Kim, W. Lu, High-density crossbar arrays based on a Si memristive system, Nano Lett., 9 (2009), 870–874. https://doi.org/10.1021/nl8037689 doi: 10.1021/nl8037689
[23]	R. Hasan, T. M. Taha, C. Yakopcic, On-chip training of memristor crossbar based multi-layer neural networks, Microelectronics J., 66 (2017), 31–40. https://doi.org/10.1016/j.mejo.2017.05.005 doi: 10.1016/j.mejo.2017.05.005
[24]	S. Wen, H. Wei, Y. Yang, Z. Guo, Z. Zeng, T. Huang, et al., Memristive LSTM network for sentiment analysis, IEEE Trans. Syst. Man Cybernet., 51 (2019), 1794–1804. https://doi.org/10.1109/TSMC.2019.2906098 doi: 10.1109/TSMC.2019.2906098
[25]	X. Liu, Z. Zeng, D. C. Wunsch, Memristor-based LSTM network with in situ training and its applications, Neural Netw. 131 (2020), 300–311. https://doi.org/10.1016/j.neunet.2020.07.035
[26]	C. Yakopcic, M. Z. Alom, T. M. Taha, Memristor crossbar deep network implementation based on a convolutional neural network, In: 2016 International joint conferenceon neural networks (IJCNN), IEEE, 2016,963–970. https://doi.org/10.1109/IJCNN.2016.7727302
[27]	C. Yakopcic, M. Z. Alom, T. M. Taha, Extremely parallel memristor crossbar architecture for convolutional neural network implementation, In: 2017 International joint conference on neural networks (IJCNN), IEEE, 2017, 1696–1703. https://doi.org/10.1109/IJCNN.2017.7966055
[28]	S. Wen, J. Chen, Y. Wu, Z. Yan, Y. Cao, Y. Yang, CKFO: Convolution kemel first operated algorithm with applications in memristor-based convolutional neural network, IEEE Trans. Comput. Design Integr. Circuits Syst., 40 (2020), 1640–1647. https://doi.org/10.1109/TCAD.2020.3019993 doi: 10.1109/TCAD.2020.3019993
[29]	P. Yao, H. Wu, B. Gao, J. Tang, Q. Zhang, W. Zhang, et al., Fully hardware-implemented memristor convolutional neural network, Nature, 577 (2020), 641–646. https://doi.org/10.1038/s41586-020-1942-4 doi: 10.1038/s41586-020-1942-4
[30]	C. Ionescu, D. Papava, V. Olaru, C. Sminchisescu, Human3.6M: Large scale datasets and predictive methods for 3d human sensing in natural environments, IEEE Trans. Pattern Anal. Machine Intell., 36 (2013), 1325–1339. https://doi.org/10.1109/TPAMI.2013.248 doi: 10.1109/TPAMI.2013.248
[31]	A. Shahroudy, J. Liu, T. T. Ng, G. Wang, NTU RGB+D: A large scale dataset for 3D human activity analysis, In: 2016 IEEE Conference on computer vision and pattern recognition (CVPR), IEEE, 2016, 1010–1019. https://doi.org/10.1109/CVPR.2016.115
[32]	Y. Du, W. Wang, L. Wang, Hierarchical recurrent neural network for skeleton based action recognition, In: 2015 IEEE Conference on computer vision and pattern recognition (CVPR), IEEE, 2015, 1110–1118. https://doi.org/10.1109/CVPR.2015.7298714
[33]	C. Li, Y. Hou, P. Wang, W. Li, Joint distance maps based action recognition with convolutional neural networks, IEEE Signal Process. Lett., 24 (2017), 624–628. https://doi.org/10.1109/LSP.2017.2678539 doi: 10.1109/LSP.2017.2678539
[34]	P. Wang, W. Li, C. Li, Y. Hou, Action recognition based on joint trajectory maps with convolutional neurall networks, Knowledge Based Syst., 158 (2018), 43–53. https://doi.org/10.1016/j.knosys.2018.05.029 doi: 10.1016/j.knosys.2018.05.029
[35]	A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, et al., PyTorch: An imperative style, high-performance deep learning library, In: Proceedings of the 33rd international conference on neural information processing systems, 2019, 8026–8037.
[36]	C. Lammie, W. Xiang, B. Linares-Barranco, M. R. Azghadi, MemTorch: An open-source simulation framework for memristive deep learning systems, Neurocomputing, 485 (2022), 124–133. https://doi.org/10.1016/j.neucom.2022.02.043 doi: 10.1016/j.neucom.2022.02.043
[37]	Hadiyawarman, F. Budiman, D. G. O. Hernowo, R. R. Pandey, H. Tanaka, Recent progress on fabrication of memristor and transistor-based neuromorphic devices for high signal processing speed with low power consumption, Jpn. J. Appl. Phys., 57 (2018), 03EA06. https://doi.org/10.7567/JJAP.57.03EA06 doi: 10.7567/JJAP.57.03EA06
[38]	S. S. Sarwar, S. A. N. Saqueb, F. Quaiyum, A. B. M. H. U. Rashid, Memristor-based nonvolatile random access memory: Hybrid architecture for low power compact memory design, IEEE Access, 1 (2013), 29–34. https://doi.org/10.1109/ACCESS.2013.2259891 doi: 10.1109/ACCESS.2013.2259891

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(1206) PDF downloads(53) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

AIMS Mathematics

Enhancing skeleton-based human motion recognition with Lie algebra and memristor-augmented LSTM and CNN

Related Papers:

Abstract

1. Introduction

2. Some properties of the alternative hyper-Poisson distribution

3. The AHP-INGARCH(p, q) model

3.1. AHP-INGARCH(1, 1) model

3.2. AHP-INARCH(1) model

4. Maximum likelihood estimation

5. Monte Carlo simulation study

6. Comparison of INGARCH models

6.1. Polio series

6.2. Internet Protocol (IP) count series

6.3. COVID-19 series

7. Conclusions

Use of AI tools declaration

Acknowledgements

Conflict of interest

Appendix A

A.1. Parameter constraint of AHP distribution

A.2. Proof of Theorem 3.1

A.3. Proof of Special Case AHP-INGARCH(1, 1)

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Some properties of the alternative hyper-Poisson distribution

3. The AHP-INGARCH(p, q) model

3.1. AHP-INGARCH(1, 1) model

3.2. AHP-INARCH(1) model

4. Maximum likelihood estimation

5. Monte Carlo simulation study

6. Comparison of INGARCH models

6.1. Polio series

6.2. Internet Protocol (IP) count series

6.3. COVID-19 series

7. Conclusions

Use of AI tools declaration

Acknowledgements

Conflict of interest

Appendix A

A.1. Parameter constraint of AHP distribution

A.2. Proof of Theorem 3.1

A.3. Proof of Special Case AHP-INGARCH(1, 1)

References