Loading [MathJax]/jax/output/SVG/jax.js
Research article

Estimation of the quadratic variation of log prices based on the Itô semi-martingale

  • Received: 02 October 2023 Revised: 23 December 2023 Accepted: 03 January 2024 Published: 12 January 2024
  • As the availability of high-frequency data becomes more widespread, it has become very popular to model random fluctuations of some econometric variables over time using Itô semi-martingale. An emblematic problem is to estimate the quadratic variation, i.e., the integrated volatility of log prices, using noisy high frequency data with endogenous time and jumps. We propose a methodology that combines the multiple sub-grids and thresholds. First, the sub-sample is used to reduce the effect of the noise. Then, the threshold method is used to get rid of the effect of jumps. Finally, the multiple sub-grids method is used to increase the convergence rate. The asymptotic properties, such as consistency and asymptotic normality, are investigated. Simulation is also included to illustrate the performance of the proposed procedure.

    Citation: Erlin Guo, Patrick Ling. Estimation of the quadratic variation of log prices based on the Itô semi-martingale[J]. Electronic Research Archive, 2024, 32(2): 799-811. doi: 10.3934/era.2024038

    Related Papers:

    [1] Yang Song, Beiyan Yang, Jimin Wang . Stability analysis and security control of nonlinear singular semi-Markov jump systems. Electronic Research Archive, 2025, 33(1): 1-25. doi: 10.3934/era.2025001
    [2] Kwok-Pun Ho . Martingale transforms on Banach function spaces. Electronic Research Archive, 2022, 30(6): 2247-2262. doi: 10.3934/era.2022114
    [3] Li-Bin Liu, Ying Liang, Jian Zhang, Xiaobing Bao . A robust adaptive grid method for singularly perturbed Burger-Huxley equations. Electronic Research Archive, 2020, 28(4): 1439-1457. doi: 10.3934/era.2020076
    [4] Hankang Ji, Yuanyuan Li, Xueying Ding, Jianquan Lu . Stability analysis of Boolean networks with Markov jump disturbances and their application in apoptosis networks. Electronic Research Archive, 2022, 30(9): 3422-3434. doi: 10.3934/era.2022174
    [5] Ramalingam Sakthivel, Palanisamy Selvaraj, Oh-Min Kwon, Seong-Gon Choi, Rathinasamy Sakthivel . Robust memory control design for semi-Markovian jump systems with cyber attacks. Electronic Research Archive, 2023, 31(12): 7496-7510. doi: 10.3934/era.2023378
    [6] Xianfei Hui, Baiqing Sun, Indranil SenGupta, Yan Zhou, Hui Jiang . Stochastic volatility modeling of high-frequency CSI 300 index and dynamic jump prediction driven by machine learning. Electronic Research Archive, 2023, 31(3): 1365-1386. doi: 10.3934/era.2023070
    [7] Ping Yang, Xingyong Zhang . Existence of nontrivial solutions for a poly-Laplacian system involving concave-convex nonlinearities on locally finite graphs. Electronic Research Archive, 2023, 31(12): 7473-7495. doi: 10.3934/era.2023377
    [8] Zhiqing Li, Wenbin Zhang, Yuanfei Li . Structural stability for Forchheimer fluid in a semi-infinite pipe. Electronic Research Archive, 2023, 31(3): 1466-1484. doi: 10.3934/era.2023074
    [9] Yijun Chen, Yaning Xie . A kernel-free boundary integral method for reaction-diffusion equations. Electronic Research Archive, 2025, 33(2): 556-581. doi: 10.3934/era.2025026
    [10] Nishui Cai, Guofeng He . Multi-cloud resource scheduling intelligent system with endogenous security. Electronic Research Archive, 2024, 32(2): 1380-1405. doi: 10.3934/era.2024064
  • As the availability of high-frequency data becomes more widespread, it has become very popular to model random fluctuations of some econometric variables over time using Itô semi-martingale. An emblematic problem is to estimate the quadratic variation, i.e., the integrated volatility of log prices, using noisy high frequency data with endogenous time and jumps. We propose a methodology that combines the multiple sub-grids and thresholds. First, the sub-sample is used to reduce the effect of the noise. Then, the threshold method is used to get rid of the effect of jumps. Finally, the multiple sub-grids method is used to increase the convergence rate. The asymptotic properties, such as consistency and asymptotic normality, are investigated. Simulation is also included to illustrate the performance of the proposed procedure.



    As the availability of high-frequency data becomes more widespread, it has become very popular to model random fluctuations of some econometric variables over time using Itô semi-martingale. Specifically, in financial mathematics, it has become very popular to model log asset prices or interest rates using the stochastic processes X=(Xt):

    dXt=btdt+σtdWt, for t[0,1]. (1.1)

    for t[0,1] [1]. An emblematic problem in econometrics is how to estimate the quadratic variation (the integrated volatility) of log prices, i.e., X,Xt=t0σ2sds.

    A classical estimator of integrated volatility is the realized volatility c.f. [2], based on the discrete time observations

    Xti for 0=t0<t1<t2<tnT, (1.2)

    and the estimator is defined as [X,X]nt=tit(ΔXti)2, where ΔXti=XtiXti1 for i1. It is well known that [X,X]ntPX,Xt [3]. However, when it comes to the reality, observed high-frequency data often exhibit complex features and complicated structures due to those issues:

    ● Jumps;

    ● Market microstructure noise;

    ● Endogenous in the price sampling times.

    For the first issue, two well-behaved estimators are the multiple-power estimator [4,5] and the realized threshold quadratic variation [6,7]. One commonly used assumption is that Xt is a jump-diffusion Itô process:

    dXt=dXct+dXdt (1.3)

    for t[0,T], where Xcand Xd are the continuous and jumps terms, whose forms are given in (2.1) and (2.2) later. Under this setting, the quadratic variation of X becomes

    [X,X]t=[Xc,Xc]t+[Xd,Xd]t=t0σ2sds+0st(ΔXs)2. (1.4)

    For the second issue, the model commonly used is the discretely observed process with micro-structure noise:

    Yti=Xcti+εti,for i=0,1,,n, (1.5)

    where {εti,i0} are i.i.d. random variables, satisfying E(εti)=0, E(ε2ti)=σ2, independent of the process Xct, and the sampling times {ti,i0} are independent of Xc. For estimating an univariate integrated volatility in the presence of microstructure noise, various estimators have been proposed by researchers, such as two-time scale realized volatility [8], multi-scale realized volatility [9], wavelet realized volatility [10], pre-averaging realized volatility [11], kernel realized volatility [12], and a quasi-maximum likelihood estimator [13]. For estimating a multivariate integrated co-volatility, various methods include a quasi-maximum likelihood estimator based on generalized sampling time [14], the pre-averaging realized volatility [15], realized kernel volatility estimator based on a refresh time scheme [16], and multi-scale realized co-volatility based on previous tick data synchronization [17]. For estimating large integrated volatility matrices, methods consist of universal thresholding [18,19,20,21], and adaptive thresholding [22].

    For the last issue, the sampling times are irregular or random but (conditionally) independent of the price process. Volatility estimation in some special situations, and in a general situation have been studied [23,24,25]. A detailed discussion on the issue of possible endogenous effect has been provided in a semi-parametric context [26], and the time endogenous effect on volatility estimation has been investigated in a non-parametric setting [27]. When there were Xc, Xd and endogenous time, Li et al. [28] developed a procedure that yields a consistent estimator of the integrated volatility. When there were Xc, microstructure noise and endogenous time, Li, Zhang and Zheng [29] considered estimators of the volatility and their asymptotic properties. Li and Guo [30] proposed a new estimator of the integrated volatility in the presence of both market micro-structure noise and jumps when sampling times are endogenous, through averaging every p observations that precede each observation in the sub-sample S to remove the effect of ε, and the method of cutting off the "big" part to remove the effect of the jump part. They obtained only an asymptotic rate n1/6δ for any δ>0 due to the local averaging of a single sub-grid being used to reduce the effect of microstructure noise.

    We must point out the differences between this paper and [31], although the methods of the two articles seem to be similar. A nonparametric procedure, based on a combination of the preaveraging method and threshold technique, is proposed to estimate the integrated volatility of an Itô semi-martingale in the presence of jumps and microstructure noise. However, we propose a methodology that combines threshold and the multiple sub-grids, to estimate the quadratic variation of an Itô semi-martingale in the presence of endogenous time, jumps, and microstructure noise. First, the sub-sample is used to reduce the effect of the noise. Then, the threshold method is used to get rid of the effect of jumps. Finally, the multiple sub-grids method is used to increase the convergence rate. Thus, the circumstances of the model and the estimated methods are both different.

    In this paper, we use the sub-sample to reduce the effect of the noise, while using the multiple sub-grids method to increase the convergence rate. Then, we use the threshold method to get rid of the effect of jumps. we attempt to develop an estimator that converges consistently to the integrated volatility in the presence of jumps, micro-structure noise and time endogenous in a general setting. The asymptotic normality of the proposed estimator is also established.

    The remainder of the paper is organized as follows. Some assumptions made by the model and introduction to the methodology are discussed in Section 2. The consistency and asymptotic normality results are given in Section 3. In Section 4, simulation results are presented. Some discussions are given in Section 5 and all the technical proofs are given in the Appendix.

    Let X=(Xt) be the log price of a single asset for continuous time t0, which is defined on a stochastic basis (Ω,F,Ft,P). Then, the model (1.3) is called an Itô semi-martingale if it has the form

    dXct=btdt+σtdWt, (2.1)
    dXdt=|x|1x(μν)(dt,dx)+|x|>1xμ(dt,dx), (2.2)

    where b and σ are locally bounded optional processes, μ is a jump measure compensated by ν; ν(dt,dx) has the form dtFt(dx), where Ft(dx) is a transition measure from Ω(0)×R+ endowed with the predictable σfield into R/0, We define β:=inf{s:|x|1|x|sFt(dx)<}, which is called the jump activity index in the literature. If 0β<1, we also say that Xd has finite variation.

    Actually, instead of observing Xt, we observe Yt due to bid-ask spread bounces, differences in trade sizes, et al., where

    Yti=Xti+εti,for i=0,1,,n, (2.3)

    where {εti,i0} are i.i.d. random variables, satisfying E(εti)=0, E(ε2ti)=σ2ε, and have common fourth moments.

    Define the quadratic variation of X as

    Xc,Xct=t0σ2sds. (2.4)

    Here, we aim to develop a new estimator for (2.4) and to investigate some asymptotic properties of the proposed estimator in the presence of jumps, micros-structure noise, and time endogenous.

    To estimate the quadratic variation of (2.4), in this section, we give a new estimator ^Xc,Xct. First, we need the notation of ¯Ytki,0 on the k-th sub-grid to reduce the effect of the noise. Then, we provide [¯Y,¯Y]Skt to get rid of the effect of jumps on the k-th grid. Finally, we use the moving average estimator ^Xc,Xct based on the multiple sub-grids to obtain the optimal rate n1/4δ. Now, let us describe the estimator in detail.

    Denote Nt=max{i:tit}, we assume that maxiΔtiP0 is driven by some underlying force, for instance, n, where n (non-random) measures the sampling frequency over the time interval [0, t]. In constructing the local average, we denote p as the number of observations, q as the size of blocks, and both are non-random numbers just as n. Define

    l:=npq,

    which satisfies that lqn, and as p shall be taken as o(n), lq/n1 as n. Moreover, for k=0,1,,q1, we define

    tki,j:=tiq+pj+k, for i=0,1,, and j=1,2,,p1. (2.5)

    We consider the time endogeneity on the sub-grid level. The sub-sample S=Sk:={tp+k,tq+p+k,,tiq+p+k,} is constructed by choosing every q-th observation starting from the p+k-th observation from the complete grid. Then, we define

    ¯Ytki,0:=1pp1j=0Ytiq+pj+k, for i=0,1,2,, and k=0,1,,q1, (2.6)

    where tki,j=tiq+pj+k and recall that tki,0=tiq+p+k denotes the i-th observation time on the k-th sub-grid.

    To get rid of the effect of jumps on the k-th grid, the realized volatility of the locally averaged Y process is defined as

    [¯Y,¯Y]Skt:=tki,0t(Δ¯Ytki,0)21{|Δ¯Ytki,0|uki,0} (2.7)

    where Δ¯Ytki,0=¯Ytki,0¯Ytki1,0 for i1.

    After correcting the bias due to noise, the threshold estimator ^Xc,Xct of Xc,Xct is provided as following:

    ^<Xc,Xc>t=1qq1k=0[¯Y,¯Y]Skt2Ltp(^σ2ε)=1qq1k=0tki,0t(Δ¯Ytki,0)21{|Δ¯Ytki,0|uki,0}2Ltp12nni=1(ΔiY)21{|ΔiY|ui}, (2.8)

    where Lt:=max{i:tki,0t}, ^σ2ε=12nni=1(ΔiY)21{|ΔiY|ui} is an estimator of σ2ε, uki,0, satisfies

    uki,0/(Δki,0)ϖ10,uki,0/(Δki,0)ϖ2, for some 0ϖ1<ϖ2<14 (2.9)

    and ui is similar to uki,0.

    In this section, the limiting behavior of the estimator will be established. To provide the asymptotic results on multiple sub-grids, the following assumptions are needed.

    ● (1) There is a filtration (Ft)t0 where (ti)i1 are (Ft)-stopping times. Furthermore, the filtration (Ft) is generated by finitely many continuous martingales.

    ● (2) Wt, bt and σ2tc>0 are adapted to a filtration (Ft), integrable and locally bounded, where c is non random;

    ● (3) Δn=max1in|titi1|=Op(1/n1η) for some nonnegative constant η;

    ● (4) Lt/lPt0rsds in D[0, 1], where rs is an adapted integrable process;

    ● (5) the microstructure noise sequence (εti)i0 consists of independent random variables with mean 0, variance σ2ε, and common finite third and forth moments, and is independent of F1.

    ● (6) ltit(q1j=1qjqΔYtij1{|ΔYtij|ui})2(ΔYti)21{|ΔiY|ui}Pt0wsσ4sds for every t[0,1], where wsσ4s is integrable, and ui satisfies (2.9);

    ● (7) 1qq1k=0ltki,0t(ΔˉYtki,0)31{|ΔˉYtki,0|uki,0}Pt0ˉvsσ3sds for every t[0,1], where ˉv2sσ4s is integrable, and uki,0 satisfies (2.9).

    Remark 1. If times are exogenous, Condition (6) can be reduced to a similar assumption in [31]. However, when observation times can be endogenous, the limit is expected to be different.

    Theorem 1. Under the models (2.1)–(2.3) and assumptions (1)–(5), suppose that η[0,1/9), and lClnα and pCpnα, for some max(4η,1/3)<α<(1η)/2 and positive constants Cl and Cp, we have

    ^<Xc,Xc>tPt0σ2sds. (3.1)

    Remark 2. In such circumstances, this result does not change the integrated variance of the limit process. The asymptotic mean-squared-error (MSE) is invariable, but must be decomposed differently (see Therem 2).

    Proof Thanks to a standard localization procedure, we can use a bounded assumption to replace the local bounded in assumptions, while we also assume that the process Xt, itself, and thus the jump process Xdt, is bounded as well. That is, for all results which need the assumption about volatility and Lévy measure, we may assume further that

    max|bt|,|σt|,|Xt|C, for some constant C>0 almost surely. (3.2)

    Recall that Yt=Xct+Xdt+εt=Zt+Xdt.

    We can divide the equation into three parts,

    ^<Xc,Xc>tt0σ2sds=1qq1k=0tki,0t[(ΔˉZtki,0+ΔˉXdtki,0)21{|ΔˉYtki,0|uki,0}(ΔˉZtki,0)2]2Ltp12nni=1[(ΔiZ+ΔiXd)21{|ΔiY|ui}(ΔiZ)2]+1qq1k=0tki,0t(ΔˉZtki,0)22Ltp12nni=1(ΔiZ)2t0σ2sds=ξ11+ξ12+ξ13, (3.3)

    where

    ξ11=1qq1k=0tki,0t[(ΔˉZtki,0+ΔˉXdtki,0)21{|ΔˉYtki,0|uki,0}(ΔˉZtki,0)2], (3.4)
    ξ12=2Ltp12nni=1[(ΔiZ+ΔiXd)21{|ΔiY|ui}(ΔiZ)2], (3.5)
    ξ13=1qq1k=0tki,0t(ΔˉZtki,0)22Ltp12nni=1(ΔiZ)2t0σ2sds. (3.6)

    (1) For ξ11, when |ΔˉZtki,0|uki,0/2, for an appropriate constant C, we have

    ξ11C|ΔˉZtki,0|2+m/(uki,0)m, (3.7)

    when |ΔˉZtki,0|<uki,0/2, we have

    |ξ11|C|ΔˉZtki,0|2|ΔˉXdtki,0|r/(uki,0)r,if|ΔˉYtki,0|>uki,0 (3.8)
    |ξ11|C((|ΔˉXdtki,0|uki,0)2+|ΔˉZtki,0|(|ΔˉXdtki,0|uki,0)),if|ΔˉYtki,0|uki,0 (3.9)

    where l and r are both any positive numbers which may change at different places. By the assumption of boundedness of the parameters, we repeatedly use Hölder's and Burkholder's inequalities, then

    E(|ΔˉXdtki,0|2)CΔki,0, (3.10)
    E(|ΔˉZtki,0|m)Cm(Δki,0)m/2, for m>0 (3.11)
    E[(|ΔˉXdtki,0|uki,0)2]CΔki,0(uki,0)2βCsΔki,0(uki,0)2s, for 0<β<s<2. (3.12)

    We deduce from above inequalities and estimations

    E|ξ11|C(|ΔˉZtki,0|2+m(uli,0)m+E|ΔˉZtki,0|2E|ΔˉXdtki,0|r(uki,0)r+E(|ΔˉXdtki,0|uki,0)2+E|ΔˉZtki,0|E(|ΔˉXdtki,0|uki,0))C[(Δki,0)2+m2(uki,0)m+(Δki,0)2+r2(uki,0)r+Δki,0(uki,0)(2s)+Δki,0(uki,0)1s/2]. (3.13)

    Let m=r=1, we have that

    E|ξ11|CΔki,0[(Δki,0)12uki,0+(uki,0)(1s/2)]. (3.14)

    By assumption of uki,0, we have 1qq1k=0tki,0tE|ξ11|0 uniformly.

    (2) For ξ12, similar to ξ11, we have 1qq1k=0tki,0tE|ξ12|0 uniformly.

    (3) For ξ13, the proof is similar to Theorem 1 of [30] or the result of Theorem 2 in [29], we have 1qq1k=0tki,0tE|ξ13|0 uniformly.

    Combining (1), (2) and (3), we can finish the proof of the theorem.

    We will use the concept of stable convergence in the Central Limit Theorem below. A sequence of random variables (r.v.s) Xn converges stably in law to a r.v. X defined on the appropriate extension of the original probability space, if and only if for any set AF and real number x, we have

    limnP(Xnx,A)=P(Xx,A). (3.15)

    We shall write it as XnSX. An immediate consequence is that for any Fmeasurable random variable σ, we have the joint weak convergence (Xn,σ)(X,σ). Hence, it is slightly stronger than convergence in law.

    Define

    A(p,q):=2qp1j=1(p2j2jp). (3.16)

    Theorem 2. Under the same assumptions in Theorem 1 and assumptions (6) and (7), then, we have

    A(p,q)n4α2ClCp/3, (3.17)

    and stably in law,

    l1/2(1qq1k=0[¯Y,¯Y]Skt2Ntpq^σ2ε(1+A(p,q))t0σ2sds)23t0ˉvsσsdXcs+t0[(4ws49ˉv2s)σ4s+8C3lCprs(σ2ε)2]1/2dBs, (3.18)

    where Bt is a standard Brownian motion that is independent of F1.

    Remark 3. The limiting process of (3.18) depends on the underlying X, the reason is that endogeneity of sampling times is existent. The endogeneity induces a bias term which is nonzero if and only if the limit in 1qq1k=0ltki,0t(ΔˉYtki,0)31{|ΔˉYtki,0|uki,0} is no longer zero. The remaining term is the variance of a normal distribution.

    Proof Since the jumps of Xt is a finite variation process when β<1, we have the following decomposition:

    Xt=X0+t0b1sds+t0σsdWs,Xt=XtXt (3.19)

    where b1s=bs|x|1sFs(dx), Zt=Xt+εt and X=stΔXs.

    Through the decomposition of (3.18), i.e.,

    l1/2(1qq1k=0[ˉY,ˉY]Skt2Ntpq^σ2ε(1+A(p,q))<Xc,Xc>t)=l1/21qq1k=0tki,0t[(ΔˉYtki,0)21{|ΔˉYtki,0|uki,0}(ΔˉZtki,0)2]l1/22Ntpq12NtNti=1[(ΔiY)21{|ΔiY|ui}(ΔiZ)2]+l1/2(1qq1k=0tki,0(ΔˉZtki,0)22Ntpq12NtNti=1(ΔiZ)2(1+A(p,q))<Xc,Xc>t), (3.20)

    it suffices to show

    l1/21qq1k=0tki,0t|(ΔˉYtki,0)21{|ΔˉYtki,0|uki,0}(ΔˉZtki,0)2|P0, (3.21)
    l1/22Ntpq12NtNti=1|(ΔiY)21{|ΔiY|ui}(ΔiZ)2|P0, (3.22)

    and

    l1/2(1qq1k=0tki,0t(ΔˉZtki,0)22Ntpq12NtNti=1(ΔiZ)2(1+A(p,q))<Xc,Xc>t)23t0ˉvsσsdXcs+t0[(4ws49ˉv2s)σ4s+8C3lCprs(σ2ε)2]1/2dBs. (3.23)

    Similar to Theorem 1, we have the following estimates:

    E(|ΔˉXtki,0|)CΔki,0, (3.24)
    E(|ΔˉZtki,0|m)C(Δki,0)m/2, for m>0 (3.25)
    E[|ΔˉXtki,0|uki,0]C(Δki,0)(uki,0)1s, for β<s<1. (3.26)

    By repeated use of Hölder's inequality and the inequality

    (|x|uki,0)2(uki,0)2m(|x|uki,0)m, for 0<m2, (3.27)

    we get

    l1/2E|(ΔˉYtki,0)21{ΔˉYtki,0uki,0}(ΔˉZtki,0)2|Cl1/2[(Δki,0)2+m2(uki,0)m+(Δki,0)r+1(uki,0)r+(Δki,0)2(uki,0)22s+(Δki,0)3/2(uki,0)1s]C(lΔki,0)1/2Δki,0[(Δki,0)m12mϖ2+(Δki,0)r12rϖ2+(Δki,0)2(1s)ϖ1+1/2+(Δki,0)(1s)ϖ1]. (3.28)

    Let sβ, and for some large enough m and r, we have

    m(1/2ϖ2)1/2>0,

    r(1ϖ2)1/2>0,

    2(1β)ϖ1+1/2>0,

    (1β)ϖ1>0,

    because ϖ1>0 and β<1. Thus, (3.21) is proved.

    Similar to (3.21), meanwhile, we combine A.3. in [29] and can get (3.22) and (3.23).

    In this part, three sample sizes n=11,700, 23,400 and 46,800 within T=1 are considered, the log price is drawn from the Ornstein-Uhlenbeck process with drift added by a symmetric stable Lévy process, namely,

    Xt=t0cos(s)ds+t0e2(ts)dWs+Xdt (4.1)

    where Ws is a standard Brownian motion, and Xdt is a symmetric βstable Lévy process.

    There are several tuning parameters (n, l, p, q and β) in the proposed estimator that have to be determined. For the sampling frequency n, we use the average number of transactions per day for the past, say 30 days as an approximation. In Theorem 1, we notice that lClnα and pCpnα, so, for (l,p,q), we choose an appropriate p. Under the following simulation setting, the standard deviation of the noise is σε:=(σ2ε)1/2 = 0.0005. We choose p=5, which is found to be good enough to reduce the effect of the micro-structure noise. Since the block size q should be larger than p, it is chosen to be 20.

    The procedure is repeated 1000 times, and the consistency and asymptotic normality of the estimator are examined. We can get the following observations from the simulation results and QQ-plot.

    Table 1.  Simulation results for β=0.25 and β=0.5 on three samples.
    β=0.25 β=0.5
    n (relative bias, s.e., mse) (relative bias, s.e., mse)
    11700 (−0.0766, 0.0530, 0.0087) (−0.0451, 0.0641, 0.0061)
    23400 (−0.0771, 0.0404, 0.0076) (−0.0511, 0.0493, 0.0050)
    46800 (−0.0764, 0.0328, 0.0069) (−0.0514, 0.0414, 0.0044)

     | Show Table
    DownLoad: CSV
    Figure 1.  QQ-plot for n=23400 and β=0.5.

    In this work, based on high-frequency transaction data, we provide a new estimator for the quadratic variation, i.e., integrated volatility, of log prices, in the presence of the endogenous time, micro-structure noise, and jumps. First, we use the sub-sample method to reduce the effect of the noise. Second, we adopt the threshold method to get rid of the effect of jumps. Finally, the multiple sub-grids method is used to increase the rate of convergence. Both the consistency and asymptotic normality of the estimator are investigated. In Theorem 2, if one assumes that Δn=Op(1/n), then η=0, and the convergence rate can be arbitrarily closed to n1/4, which is recognized as the optimal convergence rate in the presence of micro-structure noise. However, with the advance of technology in high-frequency trading, it often involves dozens or even hundreds of assets in financial applications. The corresponding integrated volatility matrix is turned to a high-dimensional problem, which motivates us to develop a new estimator to solve these issues when the observed data have endogenous time, micro-structure noise, jumps, etc.

    The authors declare that they have not used Artificial Intelligence (AI) tools in the creation of this article.

    This research was funded by Erlin Guo OF Jiangsu Province grant number BY2022768.

    The authors declare no conflicts of interest.



    [1] J. Q. Fan, A selective overview of nonparametric methods in financial econometrics (with discussion), Stat. Sci., 20 (2005), 317–357.
    [2] T. Andersen, T. Bollerslev, F. Diebold, P. Labys, Modeling and forcasting realized volatility, Econometrica, 71 (2003), 579–625. https://doi.org/10.1111/1468-0262.00418 doi: 10.1111/1468-0262.00418
    [3] P. Protter, Stochastic Integration and Differential Equations, Springer-Verlag, 2004. https://doi.org/10.2307/978-3-540-00313-4
    [4] O. E. Barndorff-Nielsen, N. Shephard, Power and bipower variation with stochastic volatility and jumps, J. Financ. Econ., 2 (2004), 1–37. https://doi.org/10.1093/jjfinec/nbh001 doi: 10.1093/jjfinec/nbh001
    [5] O. E. Barndorff-Nielsen, N. Shephard, Ecnometrics of testing for jumps in financial economics using bipower variation, J. Financ. Econ., 4 (2006), 1–30. https://doi.org/10.1093/jjfinec/nbi022 doi: 10.1093/jjfinec/nbi022
    [6] J. Jacod, Asymptotic properties of realized power variations and related functionals of semimartingales, Stoch. Proc. Appl., 118 (2008), 517–559. https://doi.org/10.1016/J.SPA.2007.05.005 doi: 10.1016/J.SPA.2007.05.005
    [7] C. Mancini, Non-parametric threshold estimation for models with stochastic diffusion coefficients and jumps, Scand. J. Stat., 36 (2009), 270–296. https://doi.org/10.1111/j.1467-9469.2008.00622.x doi: 10.1111/j.1467-9469.2008.00622.x
    [8] L. Zhang, P. A. Mykland, Y. Aït-Sahalia, A tale of two time scales: Determining integrated volatility with noisy high-frequency data, J. Am. Stat. Assoc., 100 (2005), 1394–1411. https://doi.org/10.2307/27590680 doi: 10.2307/27590680
    [9] L. Zhang, Efficient estimation of stochastic volatility using noisy observations: A multi-scale approach, Bernoulli, 12 (2006), 1019–1043. https://doi.org/10.2139/ssrn.619682 doi: 10.2139/ssrn.619682
    [10] J. Q. Fan, Y. Z. Wang, Multi-scale jump and volatility analysis for high-frequency financial data, J. Am. Statist. Assoc., 102 (2007), 1349–1362. https://doi.org/10.1198/016214507000001067 doi: 10.1198/016214507000001067
    [11] J. Jacod, Y. Li, P. A. Mykland, M. Podolskij, M. Vetter, Microstructure noise in the continuous case: The pre-averaging approach, Stoch. Proc. Appl., 179 (2009), 2249–2276. https://doi.org/10.1016/j.spa.2008.11.0047 doi: 10.1016/j.spa.2008.11.0047
    [12] O. E. Barndorff-Nielsen, P. R. Hansen, A. Lunde, N. Shephard, Designing realized kernels to measure the ex-post variation of equity prices in the presence of noise, Econometrica., 76 (2008), 1481–1536. https://doi.org/10.3982/ecta6495 doi: 10.3982/ecta6495
    [13] D. Xiu, Quasi-maximum likelihood estimation of volatility with high frequency data, J. Econometrics., 159 (2010), 235–250. https://doi.org/10.1016/j.jeconom.2010.07.002 doi: 10.1016/j.jeconom.2010.07.002
    [14] Y. Aït-Sahalia, J. Q. Fan, D. Xiu, High-frequency covariance estimates with noisy and asynchronous financial data, J. Amer. Statist. Assoc., 105 (2010), 1504–1517. https://doi.org/10.2139/ssrn.1631344 doi: 10.2139/ssrn.1631344
    [15] K. Christensen, S. Kinnebrock, M. Podolskij, Pre-averaging estimators of the ex-post covariance matrix in noisy diffusion models with non-synchronous data, J. Econometrics, 159 (2010), 116–133. https://doi.org/10.1016/j.jeconom.2010.05.001 doi: 10.1016/j.jeconom.2010.05.001
    [16] O. E. Barndorff-Nielsen, P. R. Hansen, A. Lunde, N. Shephard, Multivariate realized kernels: Consistent positive semi-definite estimators of the covariation of equity prices with noise and non-synchronous trading, J. Econometrics., 162 (2011), 149–169. https://doi.org/10.1016/j.jeconom.2010.07.009 doi: 10.1016/j.jeconom.2010.07.009
    [17] L. Zhang, Estimating covariation: epps effect, microstructure noise, J. Econometrics., 160 (2011), 33–47. https://doi.org/10.1016/j.jeconom.2010.03.012 doi: 10.1016/j.jeconom.2010.03.012
    [18] Y. Z. Wang, J. Zou, Vast volatility matrix estimation for high-frequency financial data, Ann. Statist., 38 (2010), 943–978. https://doi.org/10.1214/09-aos730 doi: 10.1214/09-aos730
    [19] M. Tao, Y. Z. Wang, X. Chen, Fast convergence rates in estimating large volatility matrices using high-frequency financial data, Economet. Theor., 29 (2013), 11–19. https://doi.org/10.2139/ssrn.3786912 doi: 10.2139/ssrn.3786912
    [20] M. Tao, Y. Z. Wang, H. Zhou, Fast convergence rates in estimating large volatility matrices using high-frequency financial data, Ann. Statist., 41 (2013), 1816–1864. https://doi.org/10.1214/13-aos1128 doi: 10.1214/13-aos1128
    [21] D. Kim, Y. Z. Wang, J. Zou, Asymptotic theory for large volatility matrix estimation based on high-frequency financial data, Stoch. Proc. Appl., 126 (2016), 3527–3577. https://doi.org/10.1016/j.spa.2016.05.004 doi: 10.1016/j.spa.2016.05.004
    [22] D. Kim, X. B. Kong, C. X. Li, Y. Z. Wang, Adaptive thresholding for large volatility matrix estimation based on high-frequency financial data, J. Econometrics, 203 (2018), 69–79. https://doi.org/10.1016/J.JECONOM.2017.09.006 doi: 10.1016/J.JECONOM.2017.09.006
    [23] M. Fukasawa, Central limit theorem for the realized volatility based on tick time sampling, Financ. Stoch., 14 (2010), 209–233. https://doi.org/10.1007/s00780-008-0087-3 doi: 10.1007/s00780-008-0087-3
    [24] M. Fukasawa, M. Rosenbaum, Central limit theorems for realized volatility under hitting times of an irregular grid, Stoch. Proc. Appl., 122 (2012), 3901–3920. https://doi.org/10.1016/j.spa.2012.08.005 doi: 10.1016/j.spa.2012.08.005
    [25] M. Fukasawa, Realized volatility with stochastic sampling, Stoch. Proc. Appl., 120 (2010), 829–852. https://doi.org/10.1016/j.spa.2010.02.006 doi: 10.1016/j.spa.2010.02.006
    [26] E. Renault, B. J. Werker, Causality effects in return volatility measures with random times, J. Econometrics., 160 (2011), 272–279. https://doi.org/10.1016/j.jeconom.2010.03.036 doi: 10.1016/j.jeconom.2010.03.036
    [27] Y. Li, E. Renault, P. A. Mykland, L. Zhang, X. Zheng, Realized volatility when sampling times are possibly endogenous, Economet. Theor., 30 (2014), 580–605. https://doi.org/10.1017/s0266466613000418 doi: 10.1017/s0266466613000418
    [28] C. X. Li, J. Y. Chen, Z. Liu, B. Y. Jing, On integrated volatility of Ito semimartingales when sampling times are endogenous, Commun. Stat-Theor. M., 43 (2014), 5263–5275. https://doi.org/10.1080/03610926.2012.730169 doi: 10.1080/03610926.2012.730169
    [29] Y. Li, Z. Zhang, X. Zheng, Volatility inference in the presence of both endogenous time and microstructure noise, Stoch. Proc. Appl., 123 (2013), 2696–2727. https://doi.org/10.1016/j.spa.2013.04.002 doi: 10.1016/j.spa.2013.04.002
    [30] C. X. Li, E. L. Guo, Estimation of the integrated volatility using noisy high-frequency data with jumps and endogeneity, Commun. Stat-Theor. M., 47 (2018), 521–531. https://doi.org/10.1080/03610926.2017.1307403 doi: 10.1080/03610926.2017.1307403
    [31] B. Y. Jing, Z. Liu, X. B. Kong, On the estimation of integrated volatility with jumps and microstructure noise, J. Bus. Econ. Stat., 32 (2014), 457–467. http://dx.doi.org/10.1080/07350015.2014.906350 doi: 10.1080/07350015.2014.906350
  • Reader Comments
  • © 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(984) PDF downloads(44) Cited by(0)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog