1.
Introduction
As the availability of high-frequency data becomes more widespread, it has become very popular to model random fluctuations of some econometric variables over time using Itô semi-martingale. Specifically, in financial mathematics, it has become very popular to model log asset prices or interest rates using the stochastic processes X=(Xt):
for t∈[0,1] [1]. An emblematic problem in econometrics is how to estimate the quadratic variation (the integrated volatility) of log prices, i.e., ⟨X,X⟩t=∫t0σ2sds.
A classical estimator of integrated volatility is the realized volatility c.f. [2], based on the discrete time observations
and the estimator is defined as [X,X]nt=∑ti≤t(ΔXti)2, where ΔXti=Xti−Xti−1 for i≥1. It is well known that [X,X]ntP→⟨X,X⟩t [3]. However, when it comes to the reality, observed high-frequency data often exhibit complex features and complicated structures due to those issues:
● Jumps;
● Market microstructure noise;
● Endogenous in the price sampling times.
For the first issue, two well-behaved estimators are the multiple-power estimator [4,5] and the realized threshold quadratic variation [6,7]. One commonly used assumption is that Xt is a jump-diffusion Itô process:
for t∈[0,T], where Xcand Xd are the continuous and jumps terms, whose forms are given in (2.1) and (2.2) later. Under this setting, the quadratic variation of X becomes
For the second issue, the model commonly used is the discretely observed process with micro-structure noise:
where {εti,i≥0} are i.i.d. random variables, satisfying E(εti)=0, E(ε2ti)=σ2, independent of the process Xct, and the sampling times {ti,i≥0} are independent of Xc. For estimating an univariate integrated volatility in the presence of microstructure noise, various estimators have been proposed by researchers, such as two-time scale realized volatility [8], multi-scale realized volatility [9], wavelet realized volatility [10], pre-averaging realized volatility [11], kernel realized volatility [12], and a quasi-maximum likelihood estimator [13]. For estimating a multivariate integrated co-volatility, various methods include a quasi-maximum likelihood estimator based on generalized sampling time [14], the pre-averaging realized volatility [15], realized kernel volatility estimator based on a refresh time scheme [16], and multi-scale realized co-volatility based on previous tick data synchronization [17]. For estimating large integrated volatility matrices, methods consist of universal thresholding [18,19,20,21], and adaptive thresholding [22].
For the last issue, the sampling times are irregular or random but (conditionally) independent of the price process. Volatility estimation in some special situations, and in a general situation have been studied [23,24,25]. A detailed discussion on the issue of possible endogenous effect has been provided in a semi-parametric context [26], and the time endogenous effect on volatility estimation has been investigated in a non-parametric setting [27]. When there were Xc, Xd and endogenous time, Li et al. [28] developed a procedure that yields a consistent estimator of the integrated volatility. When there were Xc, microstructure noise and endogenous time, Li, Zhang and Zheng [29] considered estimators of the volatility and their asymptotic properties. Li and Guo [30] proposed a new estimator of the integrated volatility in the presence of both market micro-structure noise and jumps when sampling times are endogenous, through averaging every p observations that precede each observation in the sub-sample S to remove the effect of ε, and the method of cutting off the "big" part to remove the effect of the jump part. They obtained only an asymptotic rate n1/6−δ for any δ>0 due to the local averaging of a single sub-grid being used to reduce the effect of microstructure noise.
We must point out the differences between this paper and [31], although the methods of the two articles seem to be similar. A nonparametric procedure, based on a combination of the preaveraging method and threshold technique, is proposed to estimate the integrated volatility of an Itô semi-martingale in the presence of jumps and microstructure noise. However, we propose a methodology that combines threshold and the multiple sub-grids, to estimate the quadratic variation of an Itô semi-martingale in the presence of endogenous time, jumps, and microstructure noise. First, the sub-sample is used to reduce the effect of the noise. Then, the threshold method is used to get rid of the effect of jumps. Finally, the multiple sub-grids method is used to increase the convergence rate. Thus, the circumstances of the model and the estimated methods are both different.
In this paper, we use the sub-sample to reduce the effect of the noise, while using the multiple sub-grids method to increase the convergence rate. Then, we use the threshold method to get rid of the effect of jumps. we attempt to develop an estimator that converges consistently to the integrated volatility in the presence of jumps, micro-structure noise and time endogenous in a general setting. The asymptotic normality of the proposed estimator is also established.
The remainder of the paper is organized as follows. Some assumptions made by the model and introduction to the methodology are discussed in Section 2. The consistency and asymptotic normality results are given in Section 3. In Section 4, simulation results are presented. Some discussions are given in Section 5 and all the technical proofs are given in the Appendix.
2.
Preliminaries
2.1. Model assumptions
Let X=(Xt) be the log price of a single asset for continuous time t≥0, which is defined on a stochastic basis (Ω,F,Ft,P). Then, the model (1.3) is called an Itô semi-martingale if it has the form
where b and σ are locally bounded optional processes, μ is a jump measure compensated by ν; ν(dt,dx) has the form dtFt(dx), where Ft(dx) is a transition measure from Ω(0)×R+ endowed with the predictable σ−field into R/0, We define β:=inf{s:∫|x|≤1|x|sFt(dx)<∞}, which is called the jump activity index in the literature. If 0≤β<1, we also say that Xd has finite variation.
Actually, instead of observing Xt, we observe Yt due to bid-ask spread bounces, differences in trade sizes, et al., where
where {εti,i≥0} are i.i.d. random variables, satisfying E(εti)=0, E(ε2ti)=σ2ε, and have common fourth moments.
Define the quadratic variation of X as
Here, we aim to develop a new estimator for (2.4) and to investigate some asymptotic properties of the proposed estimator in the presence of jumps, micros-structure noise, and time endogenous.
2.2. Methodology
To estimate the quadratic variation of (2.4), in this section, we give a new estimator ^⟨Xc,Xc⟩t. First, we need the notation of ¯Ytki,0 on the k-th sub-grid to reduce the effect of the noise. Then, we provide [¯Y,¯Y]Skt to get rid of the effect of jumps on the k-th grid. Finally, we use the moving average estimator ^⟨Xc,Xc⟩t based on the multiple sub-grids to obtain the optimal rate n1/4−δ. Now, let us describe the estimator in detail.
Denote Nt=max{i:ti≤t}, we assume that maxiΔtiP→0 is driven by some underlying force, for instance, n→∞, where n (non-random) measures the sampling frequency over the time interval [0, t]. In constructing the local average, we denote p as the number of observations, q as the size of blocks, and both are non-random numbers just as n. Define
which satisfies that lq≤n, and as p shall be taken as o(n), lq/n→1 as n→∞. Moreover, for k=0,1,⋯,q−1, we define
We consider the time endogeneity on the sub-grid level. The sub-sample S=Sk:={tp+k,tq+p+k,⋯,tiq+p+k,⋯} is constructed by choosing every q-th observation starting from the p+k-th observation from the complete grid. Then, we define
where tki,j=tiq+p−j+k and recall that tki,0=tiq+p+k denotes the i-th observation time on the k-th sub-grid.
To get rid of the effect of jumps on the k-th grid, the realized volatility of the locally averaged Y process is defined as
where Δ¯Ytki,0=¯Ytki,0−¯Ytki−1,0 for i≥1.
After correcting the bias due to noise, the threshold estimator ^⟨Xc,Xc⟩t of ⟨Xc,Xc⟩t is provided as following:
where Lt:=max{i:tki,0≤t}, ^σ2ε=12n∑ni=1(ΔiY)21{|ΔiY|≤ui} is an estimator of σ2ε, uki,0, satisfies
and ui is similar to uki,0.
3.
Results
In this section, the limiting behavior of the estimator will be established. To provide the asymptotic results on multiple sub-grids, the following assumptions are needed.
● (1) There is a filtration (Ft)t≥0 where (ti)i≥1 are (Ft)-stopping times. Furthermore, the filtration (Ft) is generated by finitely many continuous martingales.
● (2) Wt, bt and σ2t≥c>0 are adapted to a filtration (Ft), integrable and locally bounded, where c is non random;
● (3) Δn=max1≤i≤n|ti−ti−1|=Op(1/n1−η) for some nonnegative constant η;
● (4) Lt/lP→∫t0rsds in D[0, 1], where rs is an adapted integrable process;
● (5) the microstructure noise sequence (εti)i≥0 consists of independent random variables with mean 0, variance σ2ε, and common finite third and forth moments, and is independent of F1.
● (6) l∑ti≤t(∑q−1j=1q−jqΔYti−j1{|ΔYti−j|≤ui})2(ΔYti)21{|ΔiY|≤ui}P→∫t0wsσ4sds for every t∈[0,1], where wsσ4s is integrable, and ui satisfies (2.9);
● (7) 1q∑q−1k=0√l∑tki,0≤t(ΔˉYtki,0)31{|ΔˉYtki,0|≤uki,0}P→∫t0ˉvsσ3sds for every t∈[0,1], where ˉv2sσ4s is integrable, and uki,0 satisfies (2.9).
Remark 1. If times are exogenous, Condition (6) can be reduced to a similar assumption in [31]. However, when observation times can be endogenous, the limit is expected to be different.
Theorem 1. Under the models (2.1)–(2.3) and assumptions (1)–(5), suppose that η∈[0,1/9), and l∼Clnα and p∼Cpnα, for some max(4η,1/3)<α<(1−η)/2 and positive constants Cl and Cp, we have
Remark 2. In such circumstances, this result does not change the integrated variance of the limit process. The asymptotic mean-squared-error (MSE) is invariable, but must be decomposed differently (see Therem 2).
Proof Thanks to a standard localization procedure, we can use a bounded assumption to replace the local bounded in assumptions, while we also assume that the process Xt, itself, and thus the jump process Xdt, is bounded as well. That is, for all results which need the assumption about volatility and Lévy measure, we may assume further that
Recall that Yt=Xct+Xdt+εt=Zt+Xdt.
We can divide the equation into three parts,
where
(1) For ξ11, when |ΔˉZtki,0|≥uki,0/2, for an appropriate constant C, we have
when |ΔˉZtki,0|<uki,0/2, we have
where l and r are both any positive numbers which may change at different places. By the assumption of boundedness of the parameters, we repeatedly use Hölder's and Burkholder's inequalities, then
We deduce from above inequalities and estimations
Let m=r=1, we have that
By assumption of uki,0, we have 1q∑q−1k=0∑tki,0≤tE|ξ11|→0 uniformly.
(2) For ξ12, similar to ξ11, we have 1q∑q−1k=0∑tki,0≤tE|ξ12|→0 uniformly.
(3) For ξ13, the proof is similar to Theorem 1 of [30] or the result of Theorem 2 in [29], we have 1q∑q−1k=0∑tki,0≤tE|ξ13|→0 uniformly.
Combining (1), (2) and (3), we can finish the proof of the theorem. □
We will use the concept of stable convergence in the Central Limit Theorem below. A sequence of random variables (r.v.s) Xn converges stably in law to a r.v. X defined on the appropriate extension of the original probability space, if and only if for any set A∈F and real number x, we have
We shall write it as XnS→X. An immediate consequence is that for any F−measurable random variable σ, we have the joint weak convergence (Xn,σ)⇒(X,σ). Hence, it is slightly stronger than convergence in law.
Define
Theorem 2. Under the same assumptions in Theorem 1 and assumptions (6) and (7), then, we have
and stably in law,
where Bt is a standard Brownian motion that is independent of F1.
Remark 3. The limiting process of (3.18) depends on the underlying X, the reason is that endogeneity of sampling times is existent. The endogeneity induces a bias term which is nonzero if and only if the limit in 1q∑q−1k=0√l∑tki,0≤t(ΔˉYtki,0)31{|ΔˉYtki,0|≤uki,0} is no longer zero. The remaining term is the variance of a normal distribution.
Proof Since the jumps of Xt is a finite variation process when β<1, we have the following decomposition:
where b′1s=bs−∫|x|≤1sFs(dx), Z′t=X′t+εt and X″=∑s≤tΔXs.
Through the decomposition of (3.18), i.e.,
it suffices to show
and
Similar to Theorem 1, we have the following estimates:
By repeated use of Hölder's inequality and the inequality
we get
Let s→β, and for some large enough m and r, we have
● m(1/2−ϖ2)−1/2>0,
● r(1−ϖ2)−1/2>0,
● 2(1−β)ϖ1+1/2>0,
● (1−β)ϖ1>0,
because ϖ1>0 and β<1. Thus, (3.21) is proved.
Similar to (3.21), meanwhile, we combine A.3. in [29] and can get (3.22) and (3.23). □
4.
Simulation study
In this part, three sample sizes n=11,700, 23,400 and 46,800 within T=1 are considered, the log price is drawn from the Ornstein-Uhlenbeck process with drift added by a symmetric stable Lévy process, namely,
where Ws is a standard Brownian motion, and Xdt is a symmetric β−stable Lévy process.
There are several tuning parameters (n, l, p, q and β) in the proposed estimator that have to be determined. For the sampling frequency n, we use the average number of transactions per day for the past, say 30 days as an approximation. In Theorem 1, we notice that l∼Clnα and p∼Cpnα, so, for (l,p,q), we choose an appropriate p. Under the following simulation setting, the standard deviation of the noise is σε:=(σ2ε)1/2 = 0.0005. We choose p=5, which is found to be good enough to reduce the effect of the micro-structure noise. Since the block size q should be larger than p, it is chosen to be 20.
The procedure is repeated 1000 times, and the consistency and asymptotic normality of the estimator are examined. We can get the following observations from the simulation results and QQ-plot.
5.
Conclusions
In this work, based on high-frequency transaction data, we provide a new estimator for the quadratic variation, i.e., integrated volatility, of log prices, in the presence of the endogenous time, micro-structure noise, and jumps. First, we use the sub-sample method to reduce the effect of the noise. Second, we adopt the threshold method to get rid of the effect of jumps. Finally, the multiple sub-grids method is used to increase the rate of convergence. Both the consistency and asymptotic normality of the estimator are investigated. In Theorem 2, if one assumes that Δn=Op(1/n), then η=0, and the convergence rate can be arbitrarily closed to n1/4, which is recognized as the optimal convergence rate in the presence of micro-structure noise. However, with the advance of technology in high-frequency trading, it often involves dozens or even hundreds of assets in financial applications. The corresponding integrated volatility matrix is turned to a high-dimensional problem, which motivates us to develop a new estimator to solve these issues when the observed data have endogenous time, micro-structure noise, jumps, etc.
Use of AI tools declaration
The authors declare that they have not used Artificial Intelligence (AI) tools in the creation of this article.
Acknowledgments
This research was funded by Erlin Guo OF Jiangsu Province grant number BY2022768.
Conflict of interest
The authors declare no conflicts of interest.