1.
Introduction
Bitcoin is a digital currency first proposed by Nakamoto (2008) that is synonymous with cryptocurrencies, and it has by far the largest share of the market capitalization.* Bitcoin can be anonymously exchanged between users via peer-to-peer transactions, which are all recorded in a distributed digital ledger called a blockchain. As a result, Bitcoin is fully decentralized, and it is not controlled by any national governments, central banks, or other authorities. Since Bitcoin was released as open-source software and became operational in 2009, cryptocurrencies have drawn much attention for various reasons ranging from technological interest to financial speculation.
*Approximately 1.3 trillion US dollars at the time of writing.
Bitcoin has been officially recognized as a currency in some regions. For example, Bitcoin was adopted as a fiat currency by the Republic of El Salvador in 2021 and the Central African Republic in 2022. Argentina designated Bitcoin as a currency that can be used for contractual agreements in 2023. In these cases, Bitcoin was adopted as a relatively safe currency by countries whose own fiat currency was hamstrung by hyperinflation, capital controls, or currency sanctions.
However, the anonymity and decentralized nature of Bitcoin has also led to it being used to purchase illegal drugs and for online gambling. For example, Christin (2013) estimated that Silk Road (i.e., a black-market site for illicit drug sales) once accounted for 4.5%–9% of Bitcoin circulation. It has also been pointed out that cryptocurrencies are at risk of being used for money laundering. To combat this problem, legitimate exchange services for trading cryptocurrencies and fiat currencies have enforced increasingly strict identity verification measures.
The increasing importance of cryptocurrencies has led to more research on them, among which price prediction has been one of the most active areas of research. Various methods have been employed for price prediction including machine learning (Chen, 2023; Derbentsev et al., 2020; Iqbal et al., 2021; McNallly et al., 2018; Ranjan et al., 2023), deep learning (Bouteska et al., 2024; Kim et al., 2022; Oyedele et al., 2023; Politis et al., 2021; Tanwar et al., 2021), statistical models (Georgoula et al., 2015; Maleki et al., 2023; Malladi and Dheeriya, 2021; Poongodi et al., 2020; Shah and Zhang, 2014), sentiment analysis using textual information from social networking sites, and index-based forecasting such as the Bitcoin Misery Index (Otabek and Choi, 2022; Hajek et al., 2023; Inuduka et al., 2024; Moustafa et al., 2022; Washington et al., 2023). In addition, systems for the automatic trading of cryptocurrencies based on predicted prices have been studied (Cohen, 2023; Jing and Kang, 2024; Khurana et al., 2023; Madan, 2014; Tran, 2023).
To the best of our knowledge, however, no studies have yet focused on the trading interval of cryptocurrencies. Although some studies have considered the volatility of cryptocurrencies, research on their trading intervals is still in its infancy compared to other financial assets. A negative relationship has been identified between the trading interval and return volatility in financial markets (Nakakita and Nakatsuma, 2021; Toyabe and Nakatsuma, 2022). Nakakita and Nakatsuma (2021) focused on the stock market and found that the intraday volatility of stock indices increased just after the market opened and just before it closed showing a U-shaped seasonality when plotted on a graph. In contrast, Toyabe and Nakatsuma (2022) analyzed the trading interval of individual Japanese stocks and found that the trading interval was shorter just after the market opened and just before it closed showing an inverse U-shaped seasonality.
These results indicate that previous studies on the volatility of cryptocurrencies may help with gaining insights into their trading interval. Chu et al. (2017) fitted various models to the daily price movements of cryptocurrencies and noted that they demonstrated very high volatility. Umar and Gubareva (2020) showed that COVID-19 had similar impacts on the market volatilities of cryptocurrencies and fiat currencies. Wang et al. (2023) used machine learning methods to predict the volatility of cryptocurrencies and showed that internal determinants (e.g., lagged volatility, previous trading information) were more important than external determinants (e.g., technology, financial, and policy uncertainty factors). These studies indicate that the volatility of cryptocurrencies has similar characteristics to that of financial instruments such as fiat currencies and equities, although there are some differences.
Research on the trading intervals of conventional financial instruments such as stocks and bonds has flourished since the late 1990s with the advent of the autoregressive conditional duration (ACD) model by Engle and Russell (1999). Prior to then, analyses of daily and monthly data were the mainstream, so the autoregressive conditional heteroskedasticity (ARCH) model (Engle, 1982) and generalized ARCH (GARCH) model (Bollerslev, 1986) assumed that trading intervals had a constant duration. However, as data storage technology advanced and data granularity reached timescales of minutes, seconds, and ticks, statistical models became necessary to overcome this traditional assumption. The ACD model was developed to handle data with nonuniform time intervals. Bauwens and Veredas (2004) proposed the stochastic conditional duration (SCD) model as an extension of the ACD model that can stochastically vary the duration and volatility. Strickland et al. (2006) further extended the SCD model to a Bayesian statistical framework to utilize the flexibility and estimation stability of Bayesian statistics, and this model has since been employed to analyze the durations of trading intervals in financial markets from both a frequentist and Bayesian perspective (Men et al., 2015; Thekke et al., 2016; Toyabe et al., 2024; Xu et al., 2010).
Based on the above literature, we analyzed the trading interval duration of the Bitcoin market. As there is no litereture on the trading interval duration of cryptocurrencies to the best of our knowledge, this research should be considered significant as it is pioneering in this area. Knowing the trading interval duration of financial instruments is helpful for understanding the number and attributes of market participants, which is useful not only for traders and other business practitioners but also for supervisory authorities to set up market regulations. For instance, understanding the patterns of trade interaval durations and the factors that influence them can help investors construct less risky portfolios and market managers operate their systems more robustly. We referred to previous studies (Nakakita and Nakatsuma, 2021; Toyabe et al., 2024) to develop a model that uses Bernstein polynomials to estimate intraday seasonality and control for trends that happen to occur on the day, which we hoped would accurately capture the relationship between each explanatory variable and the trading interval duration. This paper is expected to facilitate the expansion of research on trading interval durations to cryptocurrencies.
The rest of this paper is organized as follows. Section 2 describes the data used in this study. Section 3 presents the developed statistical model and the posterior distributions of the model parameters estimated by the Markov chain Monte Carlo (MCMC) method. Section 4 discusses the estimation results. Section 6 presents the conclusions.
2.
Data
Tick data of spot trading between Bitcoin and US dollars were collected from the Binance website (Binance, 2024). At the time of writing, Binance is the world's largest Bitcoin exchange, and it has reported that the cumulative total of cryptocurrency trades reached 100 trillion dollars and the number of registered users reached 250 million (Binance, 2025). Here, spot trading was taken to refer to transactions in which financial instruments are exchanged at the current market rate. The data contained a history of all executed transactions, which could be used to examine the trading interval durations for Bitcoin. Transaction data for March and April 2024 were selected for this study because the data were the most recently available at the time of writing. By using data for two months, we aimed to ensure the robustness of our analysis. Data were only extracted each day for the 6.5-h period between 9:30 am and 4:00 pm. This is because March and April 2024 had more than 100,000,000 transactions in total, so the amount of data had to be limited to reduce computational costs. The 6.5-h period was selected to correspond with the trading hours of the US stock market. In addition, cryptocurrency exchanges are open 24 hours a day, 365 days a year, so traders can always trade on these markets. In contrast, stock markets are closed on weekends and over the New Year, etc. In this paper, we filtered the data according to trading hours, but we included weekends in the analysis. One of the aims of this paper is to examine whether there are differences between the market on weekdays and the market on weekends. As a result of data extraction, the number of executed trades used in the analysis came to 10,672,507 in March and 7,575,917 in April.
To facilitate analysis, the data were transformed by setting t as the time of day, where 9:30 corresponded to t=0 and 16:00 corresponded to t=1. Next, τi,d was taken as the i-th trading time of day d and Ti,d was taken as the i-th trading interval duration of day d. Then, Ti,d can be expressed as Ti,d=τi+1,d−τi,d, which obtained the explained variable yi,d=Ti,d/(24×60×60). Because the raw data were provided at a resolution of milliseconds, the trading interval duration could be analyzed with high accuracy. We created explanatory variables by processing information on the trade price, trade volume, and whether or not the maker is a buyer.† In the data, trade prices were available in cents, and trade volumes were available to the fifth decimal place. For this study, five explanatory variables were selected: the change in trade price, absolute value of the change in trade price, trade volume, change in trade price × trade volume, and absolute value of the change in trade price × trade volume.
†Trading entities consist of makers and takers. Makers are entities that place orders at prices that are not on the order book and make up the liquidity of the order book. Takers place orders in response to orders on the order book and take liquidity from the order book because they are executed immediately.
Let pi,d and vi,d be the i-th trade price and trade volume, respectively, on day d. Then, the change in the i-th trade price can be expressed as pi+1,d−pi,d, and the i-th trade volume can be expressed as vi+1,d. The descriptive statistics for the explained and explanatory variables are summarized in Tables 1 and 2. If the Bitcoin market is assumed to have the same characteristics as other financial markets in terms of changes in trade prices and volumes, then the regression coefficients for each explanatory variable can be expected as follows:
Log return: x1
Financial markets are known to have a leverage effect, which can be defined as a negative relationship between the price change and volatility of financial instruments (Bollerslev et al., 2006; Omori et al., 2007; Nakakita and Nakatsuma, 2021). As we noted in Section 1, a negative relationship has also been observed between the volatility and trading interval duration. Therefore, the regression coefficient is expected to be positive.
Price increase dummy: x2
There are three patterns for new transactions: price increase, no change, and price decrease. Since a price increase is thought to indicate that the market is fluctuating favorably, the regression coefficient will be negative.
Price decrease dummy: x3
The price decrease may indicate that the market is fluctuating negatively. Although there are differences between strong and weak markets, both markets are active, so it can be hypothesized that the regression coefficient will be negative.
Trade volume: x4
The trade volume is considered to indicate the activity of a market. Because a large trade volume indicates an active market, it should correspond to a shorter trading interval duration, so the coefficient for this explanatory variable should be negative.
Price increase dummy × Trade volume: x5
By multiplying the price increase dummy by the trade volume, we can see if the trading interval duration is affected not only by the direction of the price movement but also by its magnitude. We hypothesized that this explanatory variable has a positive coefficient due to the leverage effect.
Price increase dummy × Trade volume: x6
While x5,i considers the effect when the price increases, this examines the effect of the trade volume when the price decreases. As with the previous variable, it is expected that the regression coefficient will be negative due to the leverage effect.
Absolute log return × Trade volume: x7
This explanatory variable represents the monetary return of financial assets moved in a trade. Because the movement of large monetary value is considered indicative of the magnitude of movement in the market, the coefficient should be negative.
In addition to these five explanatory variables, we also added the term for intraday seasonality to control for trends that occurred during the day. A detailed description of intraday seasonality is given in Section 3.
3.
Model
3.1. Proposed model
In this section, we describe our proposed model for trading interval durations in the Bitcoin market and derive the posterior distributions of the model parameters for estimation by the MCMC method. Our proposed model is based on the SCD model:
where both εi and ηi are mutually and serially independent. The SCD model is a nonlinear non-Gaussian state-space model where Equation (1) is the observation equation and Equation (2) is the state equation. Note that all parameters should have the subscript d for the date, but they are omitted here for the sake of readability. xi is a vector of the explanatory variables, and β is a vector of the corresponding regression coefficients. For xi, in addition to the five variables described in Section 2, a term for the intraday seasonality is included in the form of a Bernstein polynomial to control for trends that occurred during the day. The n-th-order Bernstein basis function is expressed as
Linear combination of these basis functions results in the Bernstein polynomial:
The Bernstein polynomial has been proven to approximate any continuous function as n is increased to infinity (Bernstein, 1912). Summing up for xi obtains xi=[bi1 bi2 bi3 bi4 bi5 bi6 bi7 xIS′i]′. Then, we can estimate the regression coefficients of the explanatory variables, intraday seasonality, and other parameters simultaneously.
We performed the estimation separately for each day from April 1 to April 30, 2024, to test whether the Bitcoin market differed in characteristics depending on the day. We used the deviance information criterion (DIC) proposed by Spiegelhalter et al. (2002) to select the appropriate order of the Bernstein polynomial for each day.
For εi in Equation (1), we set the Weibull distribution expressed by the probability density function (PDF) as follows:
The PDF of yi is given by
Because Equation (2) is a stationary first-order autoregressive model (AR(1) model), the joint probability distribution of α=[α1…αn]′ is given by
V is a tridiagonal matrix, and it is positive definite so long as |ϕ|<1 holds. Given the above settings, the prior distributions of the parameters (β,γ,ϕ,σ2) are given by
where Gamma(aγ,bγ) denotes a gamma distribution with the shape parameter aγ and rate parameter bγ, Beta(aϕ,bϕ) denotes a beta distribution with the shape parameters aϕ and bϕ, and Inv. Gamma(aσ,bσ) denotes an inverse gamma distribution with the shape parameter aσ and scale parameter bσ.
Then, the joint posterior distribution of (α,β,γ,ϕ,σ2) for Equations (1) and (2) is given by
where p(yi|αi,β,γ) is the PDF of the Weibull distribution in Equation (5). Because the joint posterior distribution in Equation (10) cannot be estimated analytically, we employed the MCMC method to estimate the distribution numerically.
For simplicity of notation, let θ=(β,γ,ϕ,σ2). Then, we need to derive the conditional posterior distributions of the parameters in order to perform the MCMC method. The derivation of the conditional posterior distributions is complex and requires various mathematical techniques. Hence, the details are described in the appendix.
4.
Empirical analysis and results
We conducted an empirical analysis where we established four candidates for the dimensional order of the Bernstein polynomial representing the intraday seasonality: 3, 7, 11, and 15. The prior distribution was set as follows:
where I is the identity matrix.
The number of iterations for MCMC sampling was set to 1000 for the sampling period and 100 for the burn-in period. We utilized Geweke's convergence diagnostic to confirm convergence of the samples, and we extended the burn-in period if Geweke's convergence diagnostic did not pass at the 1% level for all parameters. For example, if the number of iterations was extended by 3000 and convergence was achieved after 4500 iterations in total, the first 3500 iterations were used as the burn-in period and the last 1000 iterations were used as the sampling period. With this procedure, when convergence was achieved for all parameters, the iterations were completed, and the last 1000 samples were adopted for estimation of the posterior distribution.
Tables 5 and 6 summarize the posterior mean and standard deviation of the regression coefficients and parameters (γ,ϕ,σ) for the trade price and trade volume. The estimation results for each date describe the results of the best model selected by DIC. Additionally, Figures 1 and 2 show the seasonality of trading interval durations estimated by the Bernstein polynomial. Since the regression coefficients of the Bernstein polynomial themselves do not have implications to be interpreted, the posterior statistics are presented as figures rather than tables of posterior statistics. The estimation results for each date describe the results of the best model selected by DIC. The DIC values for each model are summarized in Tables 5 and 6.
Then, let us discuss the posterior distribution in Tables 5 to 6.
Log return: β1
In both March and April, the regression coefficients were not significant on all days. The results indicate that a short-term impact between trading intervals and returns was not observed. This is consistent with the results of Toyabe et al. (2024), in which the relationship between trading intervals and returns was examined in the stock market. In this respect, it can be said that the Bitcoin market has similar characteristics to conventional financial assets such as stocks. In addition, this result suggest that the relationship between trading intervals and volatility is not completely one-sided.
Price increase dummy: β2
The posterior means were all positive for both March and April, and the results were statistically significant except for March 20th. The results indicate that when the trading price increases, the trading interval duration becomes longer than when the trading price does not change. Intuitively, we would expect the trading interval duration to be shorter when the price is increasing, as the market is considered to be more active. However, in reality, the trading interval duration was shorter when the price did not change.
Price decrease dummy: β3
The posterior means were statistically significantly positive with the exception of March 20th. While β2 is the effect of price increases, β3 is the effect of price decreases on trade intervals. Comparing the sizes of the posterior means, we found that there was no clear difference in the magnitude of β2 and β3. Therefore, both price increases and decreases have the similar effect of lengthening the duration of trades compared to when prices remain unchanged. In addition, March 20th was the only day when β2 was not significant, so the Bitcoin market on that day is thought to have shown unusual patterns.
Trade volume: β4
Both March and April's posterior means were a mixture of positive and negative signs, and they also contained both statistically significant and insignificant values. Even if considering only the significant values, their signs were a mixture of positive and negative signs. Therefore, we found that the impact of trading volume on trading interval duration differs depending on market conditions.
Price increase dummy × Trade volume: β5
This is the effect of the trading volume on the trading interval duration. The regression coefficients were nearly all positive, and 25 out of 31 days in March and 21 out of 30 days in April were significant. This shows that when the price rises in response to a trade, the trading interval duration tends to increase in line with the trading volume.
Price increase dummy × Trade volume: β6
In contrast to β5, this is the effect of the trading volume on the trading interval duration when the price moves in the negative direction. The regression coefficients were generally negative and significant. This indicates that when the price decreases due to a trade, the trading duration tends to become shorter as opposed to β5. This is similar to the leverage effect described in Section 1. Therefore, the Bitcoin market may also have a leverage effect in the same way as the stock markets.
Absolute log return × Trade volume: β7
Looking at the posterior mean, although there were many negative values, they were mixed with positive values, and furthermore, they were not statistically significant on all days. Since this regression coefficient can be interpreted as the absolute impact of a trade on the market, we found that the impact of each trade does not affect the trading interval duration.
Shape parameter: γ
This parameter is related to the shape of the error term ϵ. The values were generally close to γ=1 for all days in both March and April. When γ≤1, the mode of the Weibull distribution was zero; and when γ>1, the mode was (γ−1γ)1γ. In our setup, the mean and variance were calculated by using the gamma function Γ(⋅) as Γ(1+1γ) and Γ(1+2γ)−(Γ(1+1γ))2. When γ=1, both the mean and variance were 1.
AR(1) coefficient: ϕ
The minimum and maximum posterior means of ϕ in March and April were (0.747,0.946) and (0,705,0.912), respectively. These results indicate that the trading interval duration for Bitcoin was strongly persistent. This result is consistent with that of Nakakita and Nakatsuma (2021), who conducted a volatility analysis using stock index data, and Toyabe and Nakatsuma (2022), who conducted a duration analysis using individual stock data. Thus, the Bitcoin market appears to have characteristics similar to those of the stock market.
Variance: σ2
This parameter determines the variance of the AR(1) coefficient ϕ. Note that Tables 5 and 6 present σ instead of σ2, but the latter was generally found to converge to a value close to 1.
The figures do not indicate any commonality in the intraday seasonality, nor was there any uniformity in the order of dimensions chosen. Based on these findings, it is likely that Bitcoin does not have a common seasonality for each day. This may be because the Bitcoin market has no restrictions on trading hours. The inverse U-shaped seasonality observed by Toyabe and Nakatsuma (2022) can be attributed to the fact that the market for the analyzed financial instruments had opening and closing times, so there was a rush of orders immediately after the market opened and just before it closed. The Bitcoin market is open 24 hours a day, so time is not a factor when making orders. However, we extracted data only during the trading hours of the US stock market to examine its relationship with the Bitcoin market. Because we could not identify any seasonality common to all days, we found no clear relationship between the trading interval durations of the Bitcoin market and US stock market. In addition, there were no major differences from day to day, and no clear differences were observed between weekdays and weekends. As a result, whether the stock market was open or closed made no impact on the trade interval durations of the Bitcoin market.
5.
Discussion
In this section, we will discuss the similarities and differences with related literature, as well as future research prospects based on the results.
To the best of the authors' knowledge, there is no literature on the durations of trades in cryptocurrencies using tick data. Since research in this area is still in a developmental stage, there is potential to expand the scope of research, for example, by using other cryptocurrencies such as Ethereum or by using models other than the SCD models for estimation. Also, as mentioned earlier, the trading interval durations and volatility tend to move in conjunction. In our research, we found that the price/return and trading volume of Bitcoin are correlated with the trading interval durations. In our research, we found that the price/return and trading volume of Bitcoin are correlated with the trading interval durations. In contrast, while this research concluded that there is no intraday seasonality in the Bitcoin market, literature concludes that there is volatility in cryptocurrencies (e.g., Ben Omrane et al. (2024); Eross et al. (2019); Shanaev and Ghimire (2022); Su et al. (2022)). Furthermore, Eross et al. (2019) noted that the trading volume and volatility of Bitcoin against the US dollar and euro increased significantly during LNY hours (the hours when both the London and New York markets are open). Su et al. (2022) pointed out that the Bitcoin trading volume decreased from 4 p.m. to midnight. These findings were unrecognized in our research because we focused on the hours when the stock market was open. From these results, although there are issues with computing capacity, if we conduct an analysis without limiting the data, new discoveries may be obtained. In addition, while literature has shown that there are day-of-the-week effects on volatility (Aharon and Qadan (2019)) and within-the-month effects on pricing efficiency (Qadan et al. (2022)), there are no such effects on trading interval durations based on our results.
From those results, although the duration and volatility are seemingly linked, it is more natural to think of them as being driven by one or more third variable(s) rather than causality. In this research, we used variables related to return, price change, and volume, but other data can also be considered as candidates for explanatory variables. Also, there may be special phenomena in the cryptocurrency market, such as volatility cascades (e.g., Gradojevic and Tsiakas (2021); Brini and Lenz (2024)), where the trading interval durations have a chain-like effect. Furthermore, we can also explore ways of extracting data.
There are also other ways of collecting data that could be considered. This research analyzed the durations of all trades within the target period, but it is also possible to limit the trades themselves that are recorded. For instance, by limiting the data to only trades that occurred when a price change occurred, or only trades above a certain volume, it may be possible to obtain additional findings. In the field of volatility, there is a method that uses data that only records trades that represent significant price changes in the opposite direction of the trend, and its usefulness has been demonstrated (e.g., Tsang et al. (2016); Tsang (2021)).
6.
Conclusions
In this study, we used the SCD model to analyze trading interval durations in the Bitcoin market incorporating the trade price and trade volume as explanatory variables and including a Bernstein polynomial to consider the possibility of intraday seasonality and control for coincidental temporal trends. Our objective was to clarify the characteristics of Bitcoin and identify similarities and differences compared with other conventional financial assets such as stocks and commodities. Because the posterior distributions of the model parameters could be derived analytically, we used the MCMC method to estimate them numerically.
In the Bitcoin market, the relationship between price changes/return and trading interval duration was not observed. However, we found that positive relationships existed between the dummy variables for price increases/decreases and trade interval durations. In other words, the trading interval duration was longer in cases when the price increased or decreased compared to cases when the price did not change. In addition, on the one hand, the cross-term of the dummy variable for price increases and the trading volume also had a positive relationship with the trading interval, indicating that the trading interval was longer when the trading volume was larger in cases when the price increased. On the other hand, the cross-term of the dummy variable for price decreases and the trading volume was negatively related to the trading interval durations, showing that when prices fall, the larger the trading volume, the shorter the trading interval duration becomes.
This result is similar to the leverage effect commonly exhibited in financial markets. Note that while volatility is negatively correlated with the price decline, it was revealed that the duration of Bitcoin trades is affected by the trading volume when the price is negative rather than by the price decline itself. In addition, the effect of the trade volume itself was not statistically significant in many cases, and the effect was limited depending on the market conditions.
With regard to the period when the stock market is open, no common intraday seasonality was observed in the Bitcoin trading durations. Since the Bitcoin market is open 24 hours a day, 365 days a year, traders do not need to rush to place orders immediately after the opening or just before the closing of the stock or commodity markets. At the same time, however, the strong persistence typically seen in conventional financial assets was also observed in the Bitcoin market, so we conclude that the Bitcoin trading interval durations have both characteristics similar to and dissimilar from those of conventional financial assets.
A.
Appendix
A.1. Conditional posterior distributions
A.1.1. State variable α
The conditional posterior distribution of the state variable α is given by
To generate α from Equation (12), we apply the Metropolis–Hastings (MH) algorithm. To derive an appropriate proposal distribution for the MH algorithm, we can consider
in the neighborhood of α∗=[α∗1 … α∗n]′. Then,
where g(α) is the gradient vector of ℓ(α) and Q(α) is the Hessian matrix of ℓ(α) times −1. Then, we have
where
With the prior distribution of α∼Normal(μα,Σ−1α), the conditional posterior distribution of α can be approximated as
where C is a normalizing constant. By completing the square for the exponential of Equation (16) with respect to α, we obtain
where
Therefore, the right-hand side of Equation (16) is approximately proportional to
Then, by using Equation (18) as the proposal distribution, we can apply the MH algorithm to generate α. However, there are two practical issues that need to be addressed:
● An appropriate α∗ needs to be selected to make the approximation of Equation (16) practical.
● The MH algorithm tends to have a low acceptance rate because α is a high-dimensional vector.
To deal with the former problem, we set the mode of the conditional posterior probability as α∗. To search for the mode, the following recursive algorithm was employed:
Step 1: Initialize α∗(0), and set the counter r=1.
Step 2: Update α∗(r) by α∗(r)=μα(α∗(r−1)).
Step 3: Let r=r+1, and go to Step 2 unless max is less than the preset tolerance level.
This algorithm is equivalent to the Newton–Raphson method and thus converges in a few iterations. To solve the latter problem, we applied the block sampler proposed by Shepard and Pitt (1997) and Watanabe and Omori (2004), where the \boldsymbol{\alpha} is randomly divided into two subvectors (blocks). Then, the MH algorithm is applied to each subvector with one subvector as \boldsymbol{\alpha}_b and the other subvector as \boldsymbol{\alpha}_r :
By using Equations (18) and (19), the kernel of the conditional distribution p (\boldsymbol{\alpha}_b | \boldsymbol{\alpha}_r) is given by
By completing the square of Equation (20), we get
where
Therefore, the conditional distribution of \boldsymbol{\alpha}_b given by \boldsymbol{\alpha}_r in Equation (20) is rearranged as
We applied the MH algorithm by dividing \boldsymbol{\alpha} into a random number of blocks from 20 to 40 for each iteration of the sampling scheme.
A.1.2. Regression coefficient \boldsymbol{\beta}
The sampling strategy for the regression coefficient \boldsymbol{\beta} is almost identical to the one for the state variable \boldsymbol{\alpha} . Let \ell (\boldsymbol{\beta}) denote \log p (\boldsymbol{y} | \theta, X) . In the same manner as Equation (13), the second-order Taylor approximation of \ell (\boldsymbol{\beta}) in the neighborhood of \boldsymbol{\beta}^\ast is given by
where \boldsymbol{g} (\boldsymbol{\beta}) is the gradient vector of \ell (\boldsymbol{\beta}) and Q(\boldsymbol{\beta}) is the Hessian matrix of \ell (\boldsymbol{\beta}) times -1 . Then, we obtain
where
With the prior distribution of \boldsymbol{\beta} \sim \text{Normal} \left(\boldsymbol{\mu}_\beta, \Sigma^{-1}_\beta \right) , the conditional posterior distribution of \boldsymbol{\beta} can be approximated by
where C is a normalizing constant. By completing the square for the exponential of Equation (24) with respect to \boldsymbol{\beta} , we obtain
where
Therefore, the right-hand side of Equation (24) is approximately proportional to
The search algorithm for \boldsymbol{\beta}^\ast is the same as for \boldsymbol{\alpha} . Contrary to \boldsymbol{\alpha} , however, \boldsymbol{\beta} has a small number of dimensions, so the block sampler is not applied.
A.1.3. Shape parameter \gamma
The sampling strategy for the shape parameter \gamma is almost the same as for \boldsymbol{\alpha} and \boldsymbol{\beta} . Because the prior distribution of \gamma is not a normal distribution, we consider the log conditional posterior distribution of \gamma instead of Equation (13). Let
Then, the second-order Taylor expansion of Equation (27) with respect to \gamma in the neighborhood of \gamma^\ast > 0 is given by
where
Here, q(\gamma^\ast) is positive for any \gamma^\ast > 0 if n+a_\gamma > 1 .
By completing the square of Equation (27), we get the proposed distribution
where
Because of the global concavity of f(\gamma) , if we use the mode of f(\gamma) as \gamma^\ast , g(\gamma^\ast) = 0 always holds. Therefore, \mu_\gamma (\gamma^\ast) is practically the same as \gamma^\ast .
A.1.4. AR(1) coefficient \phi
Once the state variable \boldsymbol{\alpha} is generated, the conditional posterior distribution of \phi is given by
By completing the square for the exponential, we obtain
With the above expression in mind, we can use the following normal distribution as the proposal distribution for \phi in the MH algorithm:
A.1.5. Variance \sigma^2
Because we used the standard conditionally conjugate prior distribution for \sigma^2 , the conditional posterior distribution of \sigma^2 is given by
Because Equation (32) is the kernel of the posterior distribution of \sigma^2 , it is derived as
A.2. Centered parametrization (CP) form
Appendix A.1 presents the conditional posterior distributions for the model parameters. Although these distributions can be estimated by the MCMC method, the generated samples would have strong autocorrelation, which would result in low sampling efficiency. To improve the sampling efficiency, we employed the ancillarity-sufficiency interweaving strategy (ASIS) proposed by Yu and Meng (2011), which is an algorithm that samples from two different but essentially equal models of parameterizations. In this study, we set one model as the original model in Appendix A.1 and the other model is to be centered on the missing/latent variables of the original model (i.e., \boldsymbol{\alpha} ). We defined the original SCD model in Equations (1) and (2) as the non-centered parameterization (NCP) form and the centered model as the centered parameterization (CP) form.
We can implement the following conversion with respect to \alpha_i :
Then, the SCD model in Equations (1) and (2) becomes
The likelihood for the SCD model in Equations (35) and (36) given the observation \boldsymbol{y} and latent variable \tilde{\boldsymbol{\alpha}} is expressed as
where
Then, the joint posterior distribution of \theta for Equations (35) and (36) is given by
As with the NCP form, because the joint posterior distribution in Equation (41) cannot be obtained analytically, we employed the MCMC method to estimate it numerically. Then, let us derive the conditional posterior distributions of the parameters in order to perform MCMC with CP.
A.3. CP forms of the conditional posterior distributions
A.3.1. Regression coefficient \boldsymbol{\beta}
The conditional posterior distribution of \beta is given by
By completing the square for the exponential with respect to \boldsymbol{\beta} , we obtain
From Equation (42), the conditional posterior distribution of \boldsymbol{\beta} is given by
where
A.3.2. Shape parameter \gamma
By replacing \alpha_i + \boldsymbol{x}'_i \boldsymbol{\beta} with \tilde{\alpha_i} , Equations (6) and (27) become
Then, the second-order Taylor expansion of Equation (27) with respect to \gamma in the neighborhood of \gamma^\ast > 0 becomes
where
By completing the square of Equation (46), we obtain the following proposal distribution:
where
for sampling \gamma by the MH algorithm.
A.3.3. AR(1) coefficient \phi
By replacing \alpha_i with \tilde{\alpha}_i - \boldsymbol{x}'_i \boldsymbol{\beta} in Equation (31), we obtain
where
is the proposal distribution for \phi in the MH algorithm.
A.3.4. Variance \sigma^2 (CP form)
By replacing \alpha_i with \tilde{\alpha}_i - \boldsymbol{x}'_i \boldsymbol{\beta} in Equation (33), we obtain
as the conditional posterior distribution of \sigma^2 .
A.4. ASIS Application to MCMC
ASIS is simple to apply: the NCP and CP sampling schemes are interweaved alternately. Note that the random number generation for sampling is adopted for only one of the NCP and CP forms. Because the correct statistical estimation is possible regardless of which form is chosen, we applied the random number generation for sampling to the CP form.
Outline of the sampling scheme (ASIS)
Step 0: Initialize (\boldsymbol{\alpha}^{(0)}, \boldsymbol{\beta}^{(0)}, \gamma^{(0)}, \phi^{(0)}, \sigma^{2(0)}) and set the counter r = 0 .
Step 1.1: Generate \boldsymbol{\alpha}^{(r+1)} from p(\boldsymbol{\alpha} | \boldsymbol{\beta}^{(r)}, \gamma^{(r)}, \phi^{(r)}, \sigma^{2(r)}, \boldsymbol{y}, X) from Equation (18).
Step 1.2: Generate \boldsymbol{\beta}^{(r+0.5)} from p(\boldsymbol{\beta} | \boldsymbol{\alpha}^{(r+1)}, \gamma^{(r)}, \phi^{(r)}, \sigma^{2(r)}, \boldsymbol{y}, X) from Equation (26).
Step 1.3: Generate {\gamma}^{(r+0.5)} from p({\gamma} | \boldsymbol{\alpha}^{(r+1)}, \boldsymbol{\beta}^{(r+0.5)}, \phi^{(r)}, \sigma^{2(r)}, \boldsymbol{y}, X) from Equation (29).
Step 1.4: Generate {\phi}^{(r+0.5)} from p({\phi} | \boldsymbol{\alpha}^{(r+1)}, \boldsymbol{\beta}^{(r+0.5)}, \gamma^{(r+0.5)}, \sigma^{2(r)}, \boldsymbol{y}, X) from Equation (31).
Step 1.5: Generate {\sigma}^{2(r+0.5)} from p({\sigma^2} | \boldsymbol{\alpha}^{(r+1)}, \boldsymbol{\beta}^{(r+0.5)}, \gamma^{(r+0.5)}, \phi^{(r+0.5)}, \boldsymbol{y}, X) from Equation (33).
Step 2.1: Calculate \tilde{\boldsymbol{\alpha}}^{(r+1)} = \boldsymbol{\alpha}^{(r+1)} - X \boldsymbol{\beta}^{(r+0.5)} .
Step 2.2: Generate \boldsymbol{\beta}^{(r+1)} from p({\boldsymbol{\beta}} | \tilde{\boldsymbol{\alpha}}^{(r)}, \gamma^{(r+0.5)}, \phi^{(r+0.5)}, \sigma^{2(r+0.5)}, \boldsymbol{y}, X) from Equation (43).
Step 2.3: Generate {\gamma}^{(r+1)} from p({\gamma} | \tilde{\boldsymbol{\alpha}}^{(r+1)}, \boldsymbol{\beta}^{(r+1)}, \phi^{(r+0.5)}, \sigma^{2(r+0.5)}, \boldsymbol{y}, X) from Equation (47).
Step 2.4: Generate {\phi}^{(r+1)} from p({\phi} | \tilde{\boldsymbol{\alpha}}^{(r+1)}, \boldsymbol{\beta}^{(r+1)}, \gamma^{(r+1)}, \sigma^{2(r+0.5)}, \boldsymbol{y}, X) from Equation (48).
Step 2.5: Generate {\sigma}^{2(r+1)} from p({\sigma^2} | \tilde{\boldsymbol{\alpha}}^{(r+1)}, \boldsymbol{\beta}^{(r+1)}, \gamma^{(r+1)}, \phi^{(r+1)}, \boldsymbol{y}, X) from Equation (49).
Step 3: Let r = r+1 , and go to Step 1-1 until the burn-in iterations are completed.
Step 4: Reset the counter r = 0 , and repeat Step 1.1–2.5 R times to obtain the Monte Carlo sample (\boldsymbol{\alpha}^{(r)}, \boldsymbol{\beta}^{(r)}, \gamma^{(r)}, \phi^{(r)}, \sigma^{2(r)})_{r = 1}^R .
Author contributions
Makoto Nakakita: Conceptualization, Software, Data curation, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing.
Teruo Nakatsuma: Conceptualization, Software, Resources, Supervision, Funding acquisition, Methodology, Writing – review and editing, Project administration.
Use of AI tools declaration
The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.
Acknowledgement
This research was funded by JSPS KAKENHI Grant Numbers JP23K18819, JP20H00088.
Conflict of interest
All authors declare no conflicts of interest in this paper.