Research article

Impacts on vegetable yields, nutrient contents and soil fertility in a community garden with different compost amendments

  • This study aimed to test impacts on soil fertility, plant yield, and plant nutrient content when growing vegetables (Arugula and Radish) at different compost treatments rates (10%, 30%, 50% and 70% v/v) and with synthetic fertilizer. The compost used in this study was produced from food wastes in combination of wood chips. The results showed the impacts on vegetable growth and soil fertility varied exceptionally by the compost amendment rate. Specifically, the leafy crop experienced an increased yield with the incremented compost ratio and therefore the highest treatment (70%) generated a harvest several times larger to that of non-treated soil. For the root vegetable, the largest output was observed at a medium treatment rate (50%). Additionally, the applications revealed compost treatments at high percentages generally promoted elements N, P, K, Na, Mn, Zn and Mg within the vegetable contents. On the contrary, a low compost amount (10%) boosted Ca, Al, and Fe levels. In terms of soil fertility enrichment, the compost can improve C, N, K and Zn at medium to high treatment rates (30% to 70%). Particularly, at such amounts, the compost enhanced C and N contents within the ground soil more than the fertilizer application. Based on the gathered outcomes, root vegetables will thrive at 50% compost treatment allowing for the replacement of complete synthetic fertilizer use without significant reduction on yields and nutrients. As for leafy green vegetables, the 70 % compost concentration permits the replacement of more than half the total fertilizer usage.

    Citation: Dongyan Mu, John Hawks, Andrew Diaz. Impacts on vegetable yields, nutrient contents and soil fertility in a community garden with different compost amendments[J]. AIMS Environmental Science, 2020, 7(4): 350-365. doi: 10.3934/environsci.2020023

    Related Papers:

    [1] Roa Makki . Some characterizations of non-null rectifying curves in dual Lorentzian 3-space D31. AIMS Mathematics, 2021, 6(3): 2114-2131. doi: 10.3934/math.2021129
    [2] Semra Kaya Nurkan, İlkay Arslan Güven . Construction of vectorial moments via direction curves. AIMS Mathematics, 2023, 8(6): 12857-12871. doi: 10.3934/math.2023648
    [3] Sezai Kızıltuǧ, Tülay Erişir, Gökhan Mumcu, Yusuf Yaylı . C-partner curves with modified adapted frame and their applications. AIMS Mathematics, 2023, 8(1): 1345-1359. doi: 10.3934/math.2023067
    [4] Areej A. Almoneef, Rashad A. Abdel-Baky . Surface family pair with Bertrand pair as mutual geodesic curves in Euclidean 3-space E3. AIMS Mathematics, 2023, 8(9): 20546-20560. doi: 10.3934/math.20231047
    [5] Bahar UYAR DÜLDÜL . On some new frames along a space curve and integral curves with Darboux q-vector fields in E3. AIMS Mathematics, 2024, 9(7): 17871-17885. doi: 10.3934/math.2024869
    [6] Ayman Elsharkawy, Clemente Cesarano, Abdelrhman Tawfiq, Abdul Aziz Ismail . The non-linear Schrödinger equation associated with the soliton surfaces in Minkowski 3-space. AIMS Mathematics, 2022, 7(10): 17879-17893. doi: 10.3934/math.2022985
    [7] Faik Babadağ . A new approach to Jacobsthal, Jacobsthal-Lucas numbers and dual vectors. AIMS Mathematics, 2023, 8(8): 18596-18606. doi: 10.3934/math.2023946
    [8] Nural Yüksel . On dual surfaces in Galilean 3-space. AIMS Mathematics, 2023, 8(2): 4830-4842. doi: 10.3934/math.2023240
    [9] Yanlin Li, Kemal Eren, Kebire Hilal Ayvacı, Soley Ersoy . The developable surfaces with pointwise 1-type Gauss map of Frenet type framed base curves in Euclidean 3-space. AIMS Mathematics, 2023, 8(1): 2226-2239. doi: 10.3934/math.2023115
    [10] Haiming Liu, Jiajing Miao . Geometric invariants and focal surfaces of spacelike curves in de Sitter space from a caustic viewpoint. AIMS Mathematics, 2021, 6(4): 3177-3204. doi: 10.3934/math.2021192
  • This study aimed to test impacts on soil fertility, plant yield, and plant nutrient content when growing vegetables (Arugula and Radish) at different compost treatments rates (10%, 30%, 50% and 70% v/v) and with synthetic fertilizer. The compost used in this study was produced from food wastes in combination of wood chips. The results showed the impacts on vegetable growth and soil fertility varied exceptionally by the compost amendment rate. Specifically, the leafy crop experienced an increased yield with the incremented compost ratio and therefore the highest treatment (70%) generated a harvest several times larger to that of non-treated soil. For the root vegetable, the largest output was observed at a medium treatment rate (50%). Additionally, the applications revealed compost treatments at high percentages generally promoted elements N, P, K, Na, Mn, Zn and Mg within the vegetable contents. On the contrary, a low compost amount (10%) boosted Ca, Al, and Fe levels. In terms of soil fertility enrichment, the compost can improve C, N, K and Zn at medium to high treatment rates (30% to 70%). Particularly, at such amounts, the compost enhanced C and N contents within the ground soil more than the fertilizer application. Based on the gathered outcomes, root vegetables will thrive at 50% compost treatment allowing for the replacement of complete synthetic fertilizer use without significant reduction on yields and nutrients. As for leafy green vegetables, the 70 % compost concentration permits the replacement of more than half the total fertilizer usage.


    1. Introduction

    With the goal of developing early warning signals of future crises, many economic indicators have been analyzed in the aftermath of the 2007--2009 financial crisis. This literature has its beginnings in the prediction of stock price corrections, and by now includes measures such as credit-to-GDP growth, the debt service ratio, and gaps of property prices (Gençay and Gradojevic, 2010; Drehmann and Juselius, 2014). Attempts to estimate large scale deviations from equilibrium relations in housing markets often rely on ad-hoc rules that follow the application of the Hodrick-Prescott filter (Jordà et al., 2015). In contrast, the use of formal statistical tests, based on precise definitions of what a large deviation (i.e. a bubble) is, has been very limited. In addition, the studies that actually apply these techniques tend to presuppose rather than to examine the validity of the tests (Pavlidis et al., 2013). Thus, the question of whether statistical tests for bubble detection can be used to practically identify bubbles in housing markets remains largely unexplored*.

    *Although the Federal Reserve Bank of Dallas regularly applies the Phillips test to international markets, there are no formal assessments regarding the performance of this procedure in forecasting the tipping point of past bubbles (Mack et al., 2011).

    The assessment of these tests is relevant not only within the context of early warning systems, but also for any attempt to empirically study the causes and consequences of bubble regimes (Sarno and Taylor, 2003). An inappropriate definition of bubbles or a flawed bubble identification strategy might rule out, even in hindsight, valid and alternative interpretations of the data. If crashes are for example used to identify bubbles, bubble regimes are likely to be associated with poor economic performance, as only episodes with crashes, and potentially very evident ones, will enter the sample of the study (a typical case of selection bias). On the contrary, bubble identification strategies that do not assume a crash or economic decline as consequences of a bubble, have already yielded empirical evidence that asset price bubbles can be a welfare enhancing phenomenon (Narayan et al., 2016; Shi, 2017). Let us also mention the "social bubble hypothesis" (Gisler and Sornette, 2009, 2010; Gisler et al., 2011), which describes the existence of beneficial bubbles fostering innovations, during which strong social interactions between enthusiastic supporters weave a network of reinforcing feedbacks that lead to widespread endorsement and extraordinary commitment by those involved, beyond what would be rationalized by a standard cost-benefit analysis.

    The present paper analyzes the information content of statistical tests for bubble detection in the context of international real estate markets. We evaluate the ex-ante performance of five different indicators derived from three types of statistical tests for bubble detection: 1) two tests based on the identification of super-exponential trends; 2) one test based on the detection of mildly explosive regimes; 3) two tests based on the scaled ratio of forecasting errors. We mainly compare the indicators individually, though towards the end of the paper we also analyze simple rules to combine them.

    Following the literature on early warning signals (Drehmann and Juselius, 2014; Edison, 2003), we study the performance of the bubble indicators in the forecasting of tipping points of real estate bubbles and systemic financial crises. We run logit regressions having the indicators as dependent variables, and evaluate the performance of the regressions via the statistical significance of the area of the receiver operating characteristic curve (AUROC statistic). Our dependent variable consists of a binary indicator for the dates of the bubbles' tipping points and systemic banking crises. The dates of the bubbles' tipping points were obtained from a literature review on drivers of house prices. In total, we reviewed 19 single-country, 3 bi-country, and 3 multi-country studies to identify the periods of extreme overvaluation. We argue that our dating approach is more objective than any other technical rule previously employed in the literature, as such rules do not take into account the particularities of each housing market and depend on strong (and often symmetric) corrections to identify the end of a bubble period. The dates of systemic banking crises are those reported by Laeven and Valencia (2008), Valencia and Laeven (2012) and Drehmann et al. (2012) for the countries in our sample.

    We emphasize that, although we employ the methodologies of the literature on early warning signals, our aim, more generally, is to contribute to the understanding of the tools available to identify and analyze bubble regimes. Thus, we are not working on an early warning signal system because this would require endogenizing the cost of false positives as well as false negatives, which necessarily depends on the applications and circumstances.

    Overall, our results suggest that the statistical tests for bubble detection contain significant in-sample and out-of-sample forecasting power. All tests present significant in-sample performance, whereas three out of the five indicators present significant out-of-sample forecasting power. We contribute to the literature on early warning indicators by documenting and comparing the forecasting skill of these statistical methods for bubble detection. Our results also emphasize the importance of identifying and understanding the consequences of periods of explosive price development, as they directly link statistical exuberance to historic episodes of strong deviations from fundamental factors. In addition, the fact that the supLPPLS and supPL contain significant value relative to the other approaches highlights the need to further employ diagnosis that are sensitive to the super-exponential growth pattern, which is increasingly supported as being a key stylized fact of financial and housing bubbles (Hüsler et al., 2013; Leiss et al., 2015; Gjerstad and Smith, 2014). Finally, our results give insights on the use of price-to-rent and price-to-income ratios, which have been broadly employed to diagnose deviations from fundamental values (Bourassa et al., 2016). We find that, although the ratios indeed contain valuable information related to the bubble, the ex-ante power of the statistical tests, when applied to price-to-income and price-to-rent ratios, does not seem sufficient to ex-ante identify the bubble regime.

    This paper is organized as follows: section 2 describes the sample set and the chronology of bubbles. Section 3 presents the bubble detection tests from which our indicators are derived. Section 4 explains the methodology to construct and assess the indicators. Section 5 discusses the results. Section 6 presents the conclusions.


    2. Database and the chronology of bubbles

    Our sample comprises 18 countries, with quarterly data between 1975Q1 and 2013Q3. These economies were selected based on relevance and availability. We mostly rely on the house indices in (Mack et al., 2011), though we also use the BIS database (Scatigna et al., 2015) to obtain the real house prices from two countries from the Asia-pacific region: Thailand and South Korea. These two countries are important as Asian countries suffered a systemic financial crises between 1997 and 1998, which stopped the real estate booms taking place in these economies. Thus, their real estate cycles remained disconnected from the last housing boom of the US and Europe, and significantly enrich the information in our sample. All series are in real terms (deflated by CPI), and normalized by their respective value in 2005Q1 to ensure comparability of the units.

    Excluding Thailand, for which the time series starts in 1990Q4.

    To compare our results against existing benchmarks used in the literature of early warning signals, we gathered the countries' (log) price-to-income and (log) price-to-rent time series from the OECD database.

    Prices: Analytical house price indicators http://www.oecd-ilibrary.org/economics/data/prices/prices-analytical-house-price-indicators-edition-2016-1_6f6a769e-en. When using these ratios, we had to exclude Thailand, as this country is not part of the OECD.


    2.1. Bubble tipping points

    In order to identify the bubble tipping points, we surveyed the literature on drivers of house prices. Empirical detection of bubbles poses several challenges as there is no general theoretical or empirical definition of a bubble (Case and Shiller, 2003). Furthermore, hindsight might reveal or obscure the presence of bubble episodes. A crash might be regarded as evidence in favor of a bubble. On the contrary, absence of a strong correction following a protracted boom might lead scholars to incorrectly conclude that a period of exuberance was indeed justified by unrecognized fundamental factors.

    Hence, to conduct this study and contrary to the vast majority of indicators used in the literature of macroeconomic crises (see e.g. Jordà et al., 2015), we opt to identify bubbles by compiling the results from 25 different studies based on a fundamental analysis of prices (19 single-country, 3 bi-country, and 3 multi-country studies). These studies are typically based on inverted demand equations or life cycle consumptions models (Anundsen, 2015), and include variables such as interest rate, property tax, housing stock, disposable income, among others. We argue that our identification strategy provides us with a more objective benchmark against which to evaluate the performance of the indicators as fundamental-based studies take into consideration the internal macroeconomic conditions of the countries, especially those studies associated specifically to a single country.

    Admittedly, our survey is far from perfect as bubbles remain a contentious concept. The identification strategy is based only on articles that interpret deviations from fundamentals as signs of bubbles. Their primary assumption - unless expectations are explicitly modeled - is that the lagged appreciation unexplained by the fundamental variables represents speculative pressures. However, it is fair to acknowledge that many studies identify such deviations without explicitly referring to speculative or exuberant periods. An alternative but not necessarily incompatible explanation is that real house prices have a tendency to overshoot following a shock to the market. As a result, observed house prices are likely to deviate from the long-run equilibrium level. Another issue typically raised is that the identification strategy might be subject to misspecification, as the econometrician has no way to know the "true" model (Flood and Hodrick, 1990).

    The studies seldom report explicitly the peak of the bubble. Therefore, we chose the peak of the overvaluation period as the variable of analysis. In total, we identified 19 housing bubbles over the two long real estate cycles that the world has experienced plus the housing bubbles in Asia. Table 1 presents our complete list of tipping points matched to their original sources. The table includes the real log returns four years before and after the tipping point (Δ4 and Δ4 respectively), as well as the ratio of these two quantities (Δ4Δ4). The vertical lines in the subplots in the table represent the peaks of the bubbles.

    Table 1. Summary of 19 bubble peaks in 15 countries according to studies surveyed. The table shows the years of the peaks, the real log returns four years before and after the peaks (Δ4 and Δ4 respectively), the real house price development around these years, and the ratio Δ4Δ4. The vertical lines in the small figures denote the bubble peaks. No peaks are reported for Germany, Italy, and Thailand as Nobili and Zollino (2012) and Glindro et al. (2011) concluded that prices in these countries have evolved (mostly) according to the fundamentals.
    CountryPeak tpΔ4Δ4Δ4Δ4lnpt
    [tp4,tp+4]
    Reference
    Australia 2003 0.44 0.14 0.31 Glindro et al. (2011)
    Berry and Dalton (2004)
    Canada 2008 0.31 0.1 0.29 Walks (2014)
    Denmark 2006 0.33 -0.08 -0.25 Sørensen (2013)
    Dam et al. (2011)
    France 2008 0.35 -0.02 -0.06 Antipa and Lecat (2010)
    Ireland 2002 0.42 0.39 -0.93 Connor et al. (2012)
    2007 0.39 -0.33 -0.87 Stevenson (2008)
    Hott and Monnin (2008)
    Japan 1991 0.3 -0.3 -0.97 Hott and Monnin (2008)
    Barsky (2009)
    Netherlands 2006 0.08 0.01 0.075 Hott and Monnin (2008)
    Francke et al. (2009)
    Norway 1989 0.43 -0.32 -0.75 Anundsen and Jansen (2011)
    Jacobsen and Naug (2005)
    S. Africa 2008 0.47 -0.16 -0.34 Das et al. (2011)
    S. Korea 1991 0.23 -0.41 -1.74 Glindro et al. (2011)
    Kim and Min (2011)
    Spain 1992 0.42 -0.19 -0.46 Ayuso and Restoy (2006)
    2007 0.42 -0.18 -0.42 Neal and García-Iglesias (2013)
    Antipa and Lecat (2010)
    Hort (1998)
    Sweden 1990 0.29 -0.33 -1.13 Andreas Claussen (2013)
    Sørensen (2013)
    Switzerland 1990 0.30 -0.29 0.97 Hott and Monnin (2008)
    2013 0.19 - - Hott (2012)
    Muellbauer and Murphy (1997)
    UK 1990 0.5 -0.29 0.58 Cameron et al. (2006)
    2005 0.43 0.01 0.14 Black et al. (2006)
    Hott and Monnin (2008)
    Hott and Monnin (2008)
    US 2006 0.22 -0.22 -1 Mikhed and Zemčík(2009)
    Anundsen (2015)
     | Show Table
    DownLoad: CSV

    A salient feature is the asymmetric behavior of booms and bursts, in which the former tend to be more pronounced than the latter. Only five countries, Ireland (2007), Japan, Sweden, Switzerland, and the US, exhibit symmetric boom-bust behaviors. Four bubbles, those in Canada, France, the Netherlands, and UK (2005), present almost constant price development four years after the peak when computed in real terms. House prices in Ireland (2002) quickly recovered after the bubble-burst episode. In the Netherlands and Denmark, prices continued rising after the peak of the overvaluation so that the peak does not coincide with a local maximum in the price level, illustrating the fact that adjustments in fundamental factors, such as a drop in the interest rate, might also align prices with fundamentals and prevent a visible correction from happening. The existence of such patterns challenges identification strategies based on local maxima. Bubbles do not always suddenly burst, the end phase of the bubble can follow very different dynamics, and as Brunnermeier and Oehmke (2012) have remarked, we do not know enough yet about how and why prices evolve the way they do once the expansion phase of the real estate cycle has passed.


    2.2. Crises

    The dates of systemic crises that we take are the years with a beginning of a systemic banking crises, reported by Laeven and Valencia (2008), Valencia and Laeven (2012) and Drehmann et al. (2012) for the countries in our sample. In case of discrepancies, we choose the earliest reported year among these two sources. Table 2 presents the actual values. Canada and South Africa do not appear in the table as there are no crises reported for these two countries.

    Table 2. Systemic banking crises. The table reports the years with a beginning of a systemic banking crises. For the countries in our sample, we consolidated the dates reported by Laeven and Valencia (2008), Valencia and Laeven (2012) and Drehmann et al. (2012). In case of discrepancies, we chose the earliest reported year. Canada and South Africa do not appear with crisis events.
    Country Year crisis Country Year crisis
    Australia 1989 Norway 1990
    Denmark 2008 S. Korea 1997
    Germany 2007 Spain 2008
    France 2008 Sweden 1991, 2008
    Ireland 2008 Switzerland 2008
    Italy 2008 Thailand 1983, 1997
    Japan 1997 UK 1990, 2007
    Netherlands 2008 USA 1988, 2007
     | Show Table
    DownLoad: CSV

    3. Bubble detection tests

    In this section, we present several bubble detection tests proposed in the literature of asset bubbles. We are interested in identifying a bubble in a log prices time series lnpt with t=1,...,[τT],[τT]+1,...,T, τ(0,1) and [τT] denoting the greatest integer smaller than or equal to τT. [τT] corresponds to the starting period in which the bubble is detected.

    The tests can be motivated within the theory of rational expectations bubbles (REB). The departure point of REB is the familiar no arbitrage condition for the price of an asset,

    Pt=11+REt(Pt+1+Dt+1) (1)

    where Pt is the asset price at time t, Dt is the dividend received from the asset for ownership between t1 and t (i.e. the inputed rent in a real estate context), R>0 is the discount rate, and Et() denotes the expectation conditional on the information at time t. A log linear approximation of equation 1 and forward iteration yields (Campbell and Shiller, 1988; Phillips et al., 2011),

    pt=pft+bt , (2)

    where

    pft=κγ1ρ+(1ρ)t=1ρiEt(dt+1+i) (3)
    bt=limiρiEt(pt+i) (4)
    Et(bt+1)=1ρbt=(1+exp(¯dp))bt (5)

    with pt=lnPt, dt=lnDt, γ=ln(1+R), ρ=1/(1+exp(¯dp)), ¯dp equals the average log dividend price ratio, and κ=lnρ(1ρ)ln(1ρ1). pft is called the fundamental component of the price, and bt the rational bubble component. In the absence of a bubble (i.e. bt=0), pt=pft and pt is solely determined by the dividends. Conversely, if bt0, equation 5 implies the following process,

    bt=1ρbt1+ϵb,t=(1+g)bt1+ϵb,t ,     Et1(ϵb,t)=0 (6)

    where g=1ρ1=exp(¯dp) is the growth rate of the natural logarithm of the bubble and ϵb,t is a martingale difference. Since g>0, equation 5 implies that bt is a sub-martingale, and equation 6 states that bubbles, if they are present, should manifest explosive characteristic in log prices.

    The rational expectations bubble theory has several drawbacks. Models based on REB provide conditions for an economic equilibrium with bubbles, but they do not explain the dynamics that lead to the boom. In other words, bubbles in these models do not emerge, but they exist or appear randomly and it is unclear when/under what circumstances a market moves from a stable to an unstable regime. Rational expectation bubbles, if they exist, must be positive, infinitely lived, and hence, require assets with infinite maturity (Blanchard and Watson, 1982). These properties are at odds with empirical evidence such as, for example, the bubble on Chinese call warrants discussed by Palan (2013). Giglio et al. (2014) exclude infinitely lived bubbles on the housing markets of Singapore and the U.K§. Scheinkman and Xiong (2004) argue that models of rational bubbles are incapable of explaining the increase in trading volume that is typically observed in the historic bubble periods. Lux and Sornette (2002) demonstrate that exogenous rational bubbles are not compatible with some of the stylized facts of financial data at a very elementary level.

    §According to the authors, these markets give the best chances of detecting a bubble in the data.

    While these limitations are serious, there exists versions of REB that attempt to circumvent them, such as those developed in (Johansen et al., 2000) and (Lin and Sornette, 2013) for instance. Sornette (2002) proposed a resolution of the objections raised by Lux and Sornette (2002). We thus use below the REB theory as a common framework to illustrate the main differences among the bubble tests. At this point, it is worth stressing that the analysis of bubbles remains incomplete, and their research requires pragmatic choices as well as a constant dialogue between any proposed theory and the supporting empirical studies. In this vein, let us mention the framework recently advanced in (Schatz and Sornette, 2018) for explosive semi-martingales, which generalises the REB theory and is based on the antagonistic combination of (ⅰ) an excessive, unstable pre-crash process and (ⅱ) a drawdown starting at some random time.


    3.1. Super-exponential based procedure

    The first type of bubble detection test is based on the identification of a transient super-exponential trend in the dynamics of the log prices. This kind of dynamics was first proposed on the basis of empirical observations in (Sornette et al., 1996; Feigenbaum and Freund, 1996). It was later justified by Johansen et al. (2000) and Johansen and Sornette (1999) within the framework of rational expectation model of bubbles. From a theoretical view point, the authors argue that the no arbitrage condition, together with a hierarchical self-reinforcing organization of the market, and the need of investors to be compensated for the risk of the crash, generates a power law finite-time singular price dynamics as the bubble approaches its end. As a result of the positive feedbacks, such super-exponential dynamics is unsustainable as it ends in a finite time singularity, which signals a change of regime (the end of the bubble).

    Statistically, a bubble can thus be identified via a non-nested hypothesis test of model selection (Davidson and MacKinnon, 1981). If there is enough evidence to reject the null hypothesis of a non-explosive process in favor of the alternative super-exponential trend, the bubble hypothesis can be supported. Specifically, we compare a stationary AR(1) process in the log returns against Δln^ptsexp, the log returns predicted by a fitted super-exponential trend in the subsample between [τT] and T:

    Δlnpt=ρΔlnpt1+αsexpΔln^ptsexp+ϵt,for t=[τT],[τT]+1,...,T (7)

    where ϵt is a white noise process. The null hypothesis of no bubble after [τT] period is rejected if the t-statistic ˆtαsexp for the estimate of αsexp exceeds the corresponding critical value. When the starting date of the super-exponential trend is not known, the statistic takes the following form:

    As a remark, ρ1 would also suggest super-exponential behavior, but we chose not to test this alternative to focus on the super-exponential trend.

    supSEXP(τ0)=supτ[0,τ0]ˆtαsexp (8)

    where τ0 determines the interval in which the super exponential trend is tested.

    We employ two different super-exponential specifications to obtain Δln^ptsexp. First, the fitted values given by the log periodic power law singularity (LPPLS) model:

    lnptsexp=A+(tct)m[B+Ccos(ωln(tct)ϕ)] (9)

    where 0<m<1, B<0, 3<ω<15, and |C|(ω2+m2)1/2|B|m (Filimonov and Sornette, 2013). tc corresponds to the non-random time of the termination of the bubble. As calibration of equation 9 on quarterly data can be difficult due to the low frequency of the volatility of house prices and the relatively large number of parameters (7 in total, 3 nonlinear, 4 linear after the reformulation in (Filimonov and Sornette, 2013)), we also explore a simplification of the LPPLS model that excludes the log periodic oscillations (hereafter PL model):

    lnptsexp=A+B(tct)m (10)

    where as in equation 9, 0<m<1 and B<0. These last two conditions ensure that the instantaneous expected return diverges at tc. In practice, it does not of course, but the hypothesis is that the average price trajectory can be approximated over a time interval until close to its turning point by such a process with increasing returns.


    3.2. The Kim-Busetti-Taylor statistic

    In the case where dt is stationary (i.e. I(0)), as it is observed for the inputed rent of a real estate asset, absence of bubbles implies by equation 2 that Δpt should also be stationary. Hence, rejection of the stationarity of Δpt in favor of the non-stationarity hypothesis can be interpreted as evidence of bubbles. In this context, Homm and Breitung (2012) employed the tests of Kim (2000) and Busetti and Taylor (2004) for a bubble detection setup.

    Kim (2000) and Busetti and Taylor (2004) proposed independently a statistic for testing the null hypothesis that a stationary time series switches to a process with higher persistence (e.g. from I(0) to I(1)). Their focus is on the Gaussian unobserved components model,

    yt=βt+μt+ϵt    for t=1,2,...,T (11)
    μt=μt1+1(t>[τT]ηt)    for τ(0,1) (12)

    where 1() is the indicator function, ϵt and ηt are mutually independent mean zero IID Gaussian processes with variance σ2 and σ2n respectively, and βt is a deterministic component that we take as constant. Equations 11 and 12 describe a process that is stationary up to [τT], but is I(1) after the break if and only if σ2n>0. Consequently, a test for stationarity against the non-stationary hypothesis can be framed as testing the null hypothesis

    H0:σ2n=0 (13)

    against the alternative,

    H1:σ2n>0 (14)

    A test statistic that rejects (13) for large values is,

    KBTτ=[(1τ)T]2Tt=[τT]+1(ti=[τT]+1ˆϵ1,i)2[τT]2[τT]t=1(ti=1ˆϵ0,i)2 (15)

    where ˆϵ0,i are the OLS residuals from the regression of yt on an intercept in t=1,...,[τT], and ˆϵ1,i are the OLS residuals from the regression of yt on an intercept in t=[τT]+1,...,T. Equation 15 is a Chow-type test and can be interpreted as the scaled ratio of the sum of squared forecast errors.

    Although Homm and Breitung (2012) adapted the KBTτ statistic to be applicable directly on log prices, we opt to use the statistic given by equation 15 to test stationarity of the returns in order not to rely on the forecast associated with the random walk assumption. Thus, under the alternative hypothesis, the return process switches at [τT] from a stationary process to a process with persistent returns. If the returns are persistent, the corresponding time series exhibits explosive behavior. When the breakpoint [τT] is unknown, the statistic can be expressed as

    supKBT(τ0)=supτ[τ0,1τ0]KBTτ (16)

    and rejects the null hypothesis of no bubble for large values.


    3.3. The Busetti-Taylor statistic

    A second test also adapted by Homm and Breitung (2012) for a bubble setup is the Busetti-Taylor statistic. The Busetti-Taylor statistic takes again the two cases 13 and 14 as the null and the alternative hypotheses respectively. When τ is known, the statistic corresponds to the locally best invariant test against changes in the order of integration under the assumption of Gaussianity. Namely, it has maximum local power close to the null hypothesis under invariant transformations.

    We deviate again from Homm and Breitung (2012) and proceed as with the KBT statistic by implementing directly the test as originally proposed in (Busetti and Taylor, 2004),

    BTτ=ˆσ2(T[τT])2Tt=[τT]+1(Tj=tˆϵj)2 (17)

    where ˆϵj are the OLS residuals from the regression of Δlnpt on an intercept, and ˆσ2=T1Tt=1ˆϵ2t. When the breakpoint is not known, the Busetti-Taylor statistic takes the following form

    supBT(τ0)=supτ[0,1τ0]BTτ (18)

    and, as in the previous statistics, rejects the null hypothesis of no bubble for large values.


    3.4. The Phillips statistic

    Phillips and Yu (2011) pointed out that, since g>0 in equation 2, if bt0, equation 2 implies that pt will also be explosive irrespective of whether dt is an integrated process or a stationary process. As a direct way to test for bubbles, they thus proposed to examine evidence of explosive behavior in pt via a time-varying AR model and an econometric procedure based on recursive Dickey-Fuller (DF) t-statistics. The time-varying AR model reads

    pt=ρpt1+ϵt , (19)

    where ϵt is a white noise process with E(ϵt)=0, E(ϵ2t)=σ2, and p0=c<. Under the null hypothesis, the process starts as a random walk (i.e. ρ=1) but under the alternative, ρ becomes larger than 1 and the process changes to an explosive process at an unknown time [τT]. Thus, the procedure consists of testing directly for explosive behavior via right-tailed unit root test for certain sub-periods of the data. Let ˆρt denote the OLS estimator of ρ and σρ,τ the usual estimator for the standard deviation of ˆρτ using the subsample {p1,...,p[τT]}. The forward recursive DF tests is given by,

    supDF(τ0)=supτ[τ0,1]DFτ (20)

    with DFτ=ˆρτ1ˆσρ,τ. The test rejects the null hypothesis of no bubble for large values of supDF(τ0)||.

    ||Phillips and Yu (2011) also developed an asymptotic distribution theory for mildly explosive processes in order to date the emergence or collapse of the bubbles. As we focus on bubble detection, we refrain from presenting that part of their contributions.


    4. Ex-ante identification and evaluation of bubbles


    4.1. Ex-ante identification strategy

    We apply the procedures described in section 3 to test for bubbles in the log of real house prices for every country in our sample. The supBT, supKBT, and supDF tests are applied on rolling windows with length of 60 quarters (15 years), correcting for serial correlation in the residuals, and setting τ0=0.2. These values for τ are not far from the 0.1 value typically used in other studies, and ensure that all of our estimations are based on at least 12 quarters (3 years). The super-exponential-trend tests employ a rolling window with a maximum length of 60 quarters, setting τ0=0.3. Both types of nonlinear trends (the LPPLS and the PL) are estimated by minimizing the sum of squared residuals using the procedure described in (Filimonov and Sornette, 2013). Given the relatively small sample size of our estimations, we use Monte Carlo simulations with 15,000 replications to calculate the critical values of all tests. Results of these simulations are shown in table 3.

    Table 3. Critical values for the different bubble tests. The reported values correspond to Monte Carlo simulations for samples of size 60 over 15000 repetitions. The * indicates that more than 90% of the estimations did not fulfill the LPPLS constraints, so critical values do not apply for this level.
    Level90%95%97.5%99%99.9%
    supLPPLS*2.193.24.276.67
    supPL1.662.182.593.0784.47
    supKBT11.5415.5820.3926.7551.5
    supBT1.171.461.762.113.00
    supDF1.491.912.322.895.52
     | Show Table
    DownLoad: CSV

    A bubble signal by test j is triggered at time t when the statistic is significant at a 5% α-level:

    BIjt={1p-valuejt<α and Δylnpt00otherwise (21)

    where Δylnpt>0 denotes the yearly log returns and are set to be positive to ensure that alarms are triggered only while prices are rising.

    Figure 1 presents the resulting bubble signals for the housing markets of Australia and the UK**. For Australia (panel 1b), timely bubble signals were triggered prior to 2004 in three cases (supDF, supPL, supLPPLS), while alarms not associated with bubbles were also detected around 1990 in all but one of the indicators (supLPPLS). For UK (panel 1b), three out of the five indicators (supBT, supPL, supLPPLS) timely identify the bubble prior to the 1990 tipping point while four detected a bubble prior to the 2005 tipping point (supDF being the exception); false alarms were triggered during the second half of the 90s in two of the indicators (supDF and supKBT).

    **Signals for the other countries are available in the supplementary material of the paper.

    Figure 1. Bubble indicators for Australia and the UK. The indicators, active in the shaded areas, are triggered when the corresponding statistic exceeds the 95% critical values. The solid line depicts (left y-axis) the country's real log house index on which the tests are applied. The dash lines (right y-axis) correspond to the calculated statistics. The vertical dashed yellow and red lines denote the tipping dates of the bubbles and financial crises respectively, according to tables 1 and 2. The solid red line corresponds to a year with a crisis and a bubble peak.

    4.2. Evaluation methodology

    Following Schularick and Taylor (2012), we analyze the relationship between the bubble indicators and our dependent variable using yearly logit regressions. We use the following specification:

    ybubblei,t=Λ(MAXLAGk=0βkBIji,tk1) (22)

    where i, k, j denote indices for the country, the time lag, and the bubble indicator respectively, Λ(˙) is the logit function, MAXLAG†† is set to 3, and ybubblei,t=1 if there was a tipping point at time t in country i (and 0 otherwise). The quality of the models is assessed via the significance of the Area under the Receiver Operating Characteristics Curve (AUROC). The ROC curve characterizes the quality of a forecast system by describing the system's ability to anticipate correctly the occurrence or non-occurrence of predefined events (Mason and Graham, 2002). The AUROC is increasing with the predictive power of the indicator and lies between 0 and 1.

    ††We explored with values between 2 and 5 and the results did not qualitatively differ.

    To obtain the ROC for a probabilistic forecast system, the probability at which a positive warning is issued is varied across a range of thresholds. For each threshold, the correspondence between the forecast and the observation is determined. This correspondence is described by a two component vector defined by the rate of true and false positives. Each pair of rates gives a coordinate in the ROC space that defines the ROC curve. It can be shown that when the forecast system has skill, the area under the curve will exceed 0.5 (0.5 being the case of an uninformative indicator). Furthermore, the significance of the ROC area can be objectively assessed via a rescaled Mann-Whitney U statistic, which tests the significance of forecast event probabilities for cases where events actually occur. For large samples, the significance of these measures can be assessed using a normal-distribution approximation:

    U=AUROCee2N(μ=ee2,σ2=ee(n+1)12) (23)

    where n is the total number of forecasts, e the total number of events (i.e. bubble tipping points), and e=ne the total number of non-events. The null hypothesis of no forecasting skill is rejected for large values of U. As discussed by Mason and Graham (2002), the errors of these approximations tend to be small for relatively large samples (a large number of forecasts).

    We study the AUROC obtained in-sample for the different bubble indicators. In addition, we study two types of out-of-sample performance: by country and by period. Out-of-sample by country calibrates the model multiple times, each time leaving one country out of the calibration in order to use it to assess the models' performance. Out-of-sample by period dynamically calibrates the models using data up to the tn period and tests the performance in year tn+1 for tn=1998...2012.


    5. Empirical results


    5.1. In-sample and out-of-sample performance

    Figure 2 summarizes the main results, also reported numerically in table 4. AUROCs are expressed in percentage terms, while the respective U-statistic appears in parentheses. As a reference, we have also included the performance of logit regressions, having the corresponding country price-to-income and price-to-rent ratios as dependent variables.

    Figure 2. Receiver operating characteristic (ROC) curves derived from bubble logit regressions (22). The independent variables are the bubble indicators derived from log prices (see equations (21) and (24)). The dependent variable yi, tbubble is set to 1 if a bubble peaked at time t in country i.
    Table 4. In-sample and out-of-sample AUROCs and AUROC statistics. Each column presents the AUROC (in percentage terms) and the AUROC statistic (in parentheses) for the denoted indicator. Out-of-sample leaving one country out corresponds to excluding once each country from the dataset, estimating the model, and evaluating the model's performance for the country out of the sample. Out-of-sample by period calibrates the model up to year t and evaluates the model's performance at year t+1 for t between 1998 and 2012. Significance levels are marked with *, **, and *** for 95%, 99%, and 99.9% respectively.
    Test:Single bubble indicators (based on log prices)HPI / IncomeHPI / Rent
    supLPPLSsupPLsupKBTsupBTsupDF
    In-sample 82.380
    (4.811)
    85.994
    (5.348)
    70.290
    (3.015)
    84.529
    (5.131)
    72.831
    (3.392)
    82.45
    (4.682)
    85.95
    (5.331)
    Out-of-sample by period 78.185
    (3.305)
    79.423
    (3.450)
    62.908
    (1.514)
    74.306
    (2.850)
    53.306
    (0.388)
    72.522
    (2.637)
    67.117
    (2.004)
    Out-of-sample by country 69.561
    (2.910)
    74.428
    (3.634)
    52.727
    (0.406)
    75.192
    (3.748)
    52.386
    (0.355)
    82.72
    (4.722)
    86.429
    (5.401)
    Single bubble indicators (based on log price-to-income)
    In-sample 74.306
    (3.608)
    78.536
    (4.236)
    60.353
    (1.537)
    73.776
    (3.529)
    53.481
    (0.517)
    Out-of-sample by period 64.231
    (1.667)
    64.615
    (1.712)
    65.641
    (1.832)
    69.455
    (2.278)
    58.109
    (0.950)
    Out-of-sample by country 58.868
    (1.318)
    64.203
    (2.111)
    49.123
    (0.130)
    60.988
    (1.633)
    33.707
    (2.422)
    Single bubble indicators (based on log price-to-rent)
    In-sample 82.867
    (4.879)
    79.208
    (4.336)
    60.138
    (1.505)
    74.210
    (3.594)
    59.265
    (1.375)
    Out-of-sample by period 77.484
    (3.219)
    63.734
    (1.608)
    36.875
    (1.537)
    54.936
    (0.578)
    37.019
    (1.520)
    Out-of-sample by country 63.641
    (2.028)
    63.028
    (1.937)
    36.057
    (2.073)
    61.093
    (1.649)
    35.373
    (2.174)
    Combined bubble indicators (based on log prices)
    Threshold:12345
    In-sample 89.177
    (5.821)
    88.763
    (5.760)
    87.747
    (5.609)
    66.095
    (2.392)
    58.804
    (1.308)
    Out-of-sample by period 77.249
    (3.196)
    84.617
    (4.059)
    74.245
    (2.843)
    51.495
    (0.175)
    46.633
    (0.395)
    Out-of-sample by country 71.967
    (3.268)
    76.822
    (3.991)
    75.520
    (3.797)
    47.899
    (0.313)
    38.900
    (1.651)
     | Show Table
    DownLoad: CSV

    The in-sample AUROC of the different indicators is large with high statistical significance. The first row of table 4 shows that all AUROCs are above 0.7, and significant at the 99% level. The supPL test exhibits the highest performance, followed by the supBT and supLPPLS tests. The performance of the other two indicators, supKBT and supDF, is somewhat lower but still significantly positive. In-sample results are thus encouraging and suggest that the indicators contain relevant information associated to the end of the bubble. In order to test whether they translate to out-of-sample performance, we move now to the out-of-sample analysis. In 3 out of 5 cases, the values are comparable to those obtained when using the price-to-income and price-to-rent ratios, which exhibit high AUROC values, in line with the positive results for these quantities reported in the literature (Bourassa et al., 2016).

    Panels 2c and 2e show the results for the out-of-sample by period and by country tests respectively. Not surprisingly, the performance of all regressions drops visibly, from an average of 0.79 to an average of 0.69 (out-of-sample by period) and 0.65 (one-country-out). Nevertheless, three of the indicators contain significant predictive power in the out-of-sample by period test, exhibiting AUROC close to or above 0.75. The supLP test achieves the highest predictive power, followed by the supLPPLS (2nd) and the supBT tests (3rd). The performance of these cases is even higher that those of the price-to-income and price-to-rent ratios, respectively at 0.72 and 0.67, significant at the 99% level.

    As for the setup consisting of leaving one country out, the results are somewhat less favorable. On one hand, The AUROCs based on the supBT, the supPL, the supLPPLS tests are still significant, but the significance of the latter dropped, while supBT, showing the best performance, is still significant at the 99.9% level. On the other hand, the AUROCs of the indicators based on the supKBT and the supDF tests have decreased more (16.1% and 1.7% respectively). The performance of the indicators have thus decreased, evidencing the indicators' difficulties in transferring the learnings from a set of countries to an unseen market. This contrasts with the price-to-income and price-to-rent ratios, which reached AUROC values above the 0.8 level, perhaps because they are better able to exploit the correlation between OECD house prices observed during the last global real estate cycle.

    The differences between the results of the tests might be explained in several ways. The fact that the supBT statistic dominated the supKBT statistic, despite the fact that they assume the same data generating process, highlights the impact that a test's statistical power has on the bubble detection. The observed lacking forecasting skill of the supDF statistic, especially compared to those reported by Homm and Breitung (2012), might stem from our use of sliding windows of a maximum of 60 quarters to ensure that any alarm corresponds to a current bubble episode‡‡. The satisfactory results of the supLPPLS and the supPL tests might be revealing key features of bubbles, since they presuppose bubbles as fundamentally different market regimes, driven by slow-maturation and transient non-linear dynamics with a necessary end associated with the finite-time singularity. Other readings are possible, but we decided not to inquire further, as surely our results fall short of disentangling the origin of the variations in the results of the tests. Admittedly, the bubble analysis that we have conducted face a joint-hypothesis problem, as the differences among the tests might arise either because of the theory on which the statistics are based or because of their statistical power.

    ‡‡Homm and Breitung (2012) employed the whole sample to date the bubbles, which is difficult in an ex-ante setup. Although they also proposed the CUSUM and FLUC tests to monitor time series, these statistics require a bubble-free period as a training sample, which would introduce further complications and make difficult a comparison between the tests.

    Nevertheless, beyond the specific results of each of the indicators, we argue that the emerging picture is very positive. The bubble tests have forecasting skill associated to the end of the bubble, and they can complement the ex-ante identification based on the HP filter and similar approaches, often used in the literature. As the bubble tests rely on very few parameters inferred from the past, we believe they are likely to be more robust, and therefore more appropriate, to ex-ante study future housing boom episodes. Moreover, the fact that the supLPPLS and supPL contain significant value relative to the other approaches highlights the need to further employ diagnosis that are sensitive to the super-exponential growth pattern, which is increasingly supported as being a key stylized facts of financial and housing bubbles (Hüsler et al., 2013; Leiss et al., 2015; Gjerstad and Smith, 2014).


    5.2. Assessment based on Price-to-Income and Price-to-Rent ratios

    A common question regarding the practical application of the statistical tests that we have analyzed is whether they should be applied directly on prices, or on some measure intended to reflect the misalignment of prices with the fundamental value, such as price-to-income and price-to-rent ratios. In this section, we seek to shed some light on this issue by analyzing the performance of the analogous bubbles indicators based on these two ratios. Figure 3 summarizes the main results, also reported numerically in panels 2 and 3 of table 4.

    Figure 3. Receiver operating characteristic (ROC) curves derived from bubble logit regressions (22). The independent variables are the bubble indicators derived from log priceto-income and price-to-rent ratios (see equations (21) and (24)). The dependent variable yi, tbubble is set to 1 if a bubble peaked at time t in country i.

    The performance of the ratio-based indicators drops substantially relative to that of the price-based indicators. In-sample, average AUROCs values decrease from 0.78 to 0.68 and 0.71 for the price-to-income and price-to-rent ratios respectively. Out-of-sample by period, the corresponding average values fall from 0.69 to 0.64 and 0.54. Out-of-sample by country, the average AUROCs values plunge to the relatively low levels of 0.53 and 0.51 for the price-to-income and price-to-rent ratios respectively.

    The significance of the AUROC statistics also deteriorates. For both ratios, two of the indicators (supKBT and supDF) are not in-sample significant anymore; in contrast to the highly in-sample significance levels observed by the use of price levels. Only the supLPPLS-based indicator yields a significant out-of-sample by period AUROC value when derived from the price-to-rent ratios, while the significance level of all the indicators decreases in the equivalent quantities derived from the price-to-income ratios. Similarly, in the out-of-sample by country set-up, there is only one indicator derived from price-to-income ratios with significant results (supPL), and the significance level of the three significant indicators derived from price-to-rent ratios is lower than the equivalent quantities derived from prices. As we described in the last section, these patterns contrast sharply with the performances of the regressions based directly on either of the indicators, which are relatively high and significant.

    Lastly, in figure 3 we observe that the maximum attainable rate of true positive decreases out-of-sample for most of the indicators, which differs from the perfect true positive rate generally attainable by the indicators derived from price levels. In the out-of-sample by country set-up, none of the indicators reaches a perfect rate of true positives, and only the supLPPLS indicator achieves the maximum rate in the out-of-sample by period set-up, when derived from the log price to rent ratio. In other words, the bubble tests when applied on ratios, have not exhibited sufficient power to ex-ante reject the null hypothesis of no bubble. As a consequence, there is no path for the corresponding indicators to timely trigger the bubble signal, for any admissible false positive rate.

    In our view, the lower performance yielded by the application of the bubble tests on these two ratios points to the endogenous nature of housing bubbles, which often co-occur as part of a more broader macroeconomic phenomenon. To the extent that rent, disposable income, and possibly other macroeconomic factors might also be endogenous to the bubble regime - as it is arguably the case of housing and credit booms coinciding with economic booms -, ratio-based tests might lack the statistical power to detect the bubble episode. Prices might be considered in line with fundamentals, because the so-called fundamental factors might also be going through an unstable phase.


    5.3. Combination of the indicators

    We now explore whether there are gains in combining the different bubble indicators (BI). We create a composite indicator based on the number of simultaneous alarms at a given time period. Namely:

    ICt(k)={15j=1BIj,tk0otherwise (24)

    where j denotes one of the five indicators and k corresponds to the necessary threshold to trigger the composite bubble signal. Panels 2b, 2f and 2d of figure 2, and the second part of table 4 present the results.

    We make two observations. First, combining the single indicators seems to yield more robust performance. Excluding the results for k=4,5, which correspond anyhow to very high thresholds, the in-sample and out-of-sample AUROCs are significant and above 0.87 (in-sample) and 0.72 (out-of-sample). Second, these results are positive enough to improve over the single top performing indicator (supPL or suptBT depending on the setup). The in-sample performance of the composite indicator is notably higher than the single indicator and the out-of-sample results are up to 6.5 percentage points higher. These two observations together suggest that, in the presence of model uncertainty, combining the indicators (i.e. using multiple tests for bubbles in a more general set-up) is a sensible decision. The resulting gains might derive not only in greater performance, but also in increasing robustness. Our results are thus consistent with the rather large literature, both in economics and more generally in pattern recognition, on the improved performance of compound predictors (Kittler et al., 1998; Timmermann, 2006). The gains are remarkable in light of the simplicity of the rule that we have employed.


    5.4. Bubbles and systemic financial crises

    We have already presented supporting evidence that the bubble tests contain significant information to identify the end of the bubble period. We now contrast these results with those obtained by applying the same tests to forecast systemic financial crises. The goal is to further shed light on the phenomenon that the bubble tests are capturing. We use the regression

    ycrisisi,t=Λ(MAXLAGk=0βkBIji,tk1) (25)

    where i, k, j denotes indices for the country, the time lag, and the bubble indicator respectively, Λ() is the logit function, MAXLAG is set to 3, and ycrisisi,t=1 if a crisis started at time t in country i (and 0 otherwise). As before, we discuss the AUROCs and AUROC statistics to analyze the indicators' performance.

    Figure 4 and table 5 summarize the results for the single and combined indicators. The differences in performance are appreciable. In general, only the AUROC of in-sample regressions are statistically significant. The performance of out-of-sample AUROCs of single and combined indicators drops visibly and no single indicator yields significant results. Among the out-of-sample tests, the maximum achieved AUROC slightly exceeds the 0.65 value (combined BI, with threshold 2) against the 0.77 value of the equivalent quantity reached by the bubble logit regressions of the previous section.

    Figure 4. Receiver operating characteristic (ROC) curves derived from crisis logit regressions (25). The independent variables are the bubble indicators derived from log prices (see equations (21) and (24)). The dependent variable yi, tcrisis is set to 1 if a crisis started at time t in country i.
    Table 5. In-sample and out-of-sample AUROCs and AUROC statistics. Each column presents the AUROC (in percentage terms) and the AUROC statistic (in parentheses) for the denoted indicator. Out-of-sample leaving one country out corresponds to excluding once each country from the dataset, estimating the model, and evaluating the model's performance for the country out of the sample. Out-of-sample by period calibrates the model up to year t and evaluates the model's performance at year t+1 for t between 1998 and 2012. Significance levels are marked with *, **, and *** for 95%, 99%, and 99.9% respectively.
    Test:Single bubble indicators (based on log prices)HPI / IncomeHPI / Rent
    supLPPLSsupPLsupKBTsupBTsupDF
    In-sample82.380
    (4.811)
    85.994
    (5.348)
    70.290
    (3.015)
    84.529
    (5.131)
    72.831
    (3.392)
    82.45
    (4.682)
    85.95
    (5.331)
    Out-of-sample by period78.185
    (3.305)
    79.423
    (3.450)
    62.908
    (1.514)
    74.306
    (2.850)
    53.306
    (0.388)
    72.522
    (2.637)
    67.117
    (2.004)
    Out-of-sample by country69.561
    (2.910)
    74.428
    (3.634)
    52.727
    (0.406)
    75.192
    (3.748)
    52.386
    (0.355)
    82.72
    (4.722)
    86.429
    (5.401)
    Single bubble indicators (based on log price-to-income)
    In-sample74.306
    (3.608)
    78.536
    (4.236)
    60.353
    (1.537)
    73.776
    (3.529)
    53.481
    (0.517)
    Out-of-sample by period64.231
    (1.667)
    64.615
    (1.712)
    65.641
    (1.832)
    69.455
    (2.278)
    58.109
    (0.950)
    Out-of-sample by country58.868
    (1.318)
    64.203
    (2.111)
    49.123
    (0.130)
    60.988
    (1.633)
    33.707
    (2.422)
    Single bubble indicators (based on log price-to-rent)
    In-sample82.867
    (4.879)
    79.208
    (4.336)
    60.138
    (1.505)
    74.210
    (3.594)
    59.265
    (1.375)
    Out-of-sample by period77.484
    (3.219)
    63.734
    (1.608)
    36.875
    (1.537)
    54.936
    (0.578)
    37.019
    (1.520)
    Out-of-sample by country63.641
    (2.028)
    63.028
    (1.937)
    36.057
    (2.073)
    61.093
    (1.649)
    35.373
    (2.174)
    Combined bubble indicators (based on log prices)
    Threshold:12345
    In-sample 89.177
    (5.821)
    88.763
    (5.760)
    87.747
    (5.609)
    66.095
    (2.392)
    58.804
    (1.308)
    Out-of-sample by period 77.249
    (3.196)
    84.617
    (4.059)
    74.245
    (2.843)
    51.495
    (0.175)
    46.633
    (0.395)
    Out-of-sample by country 71.967
    (3.268)
    76.822
    (3.991)
    75.520
    (3.797)
    47.899
    (0.313)
    38.900
    (1.651)
     | Show Table
    DownLoad: CSV

    We interpret these results as clear evidence that bubbles and crises are different phenomena. This is not to deny the strong relationship that may exist among them, but to emphasize that they should be treated and analyzed differently. In addition, the poor performance of the indicators to predict crises may be interpreted as a sanity check for our results. The indicators have been able to capture well the events for which they are intended, and have underperformed when used to forecast a related, but different, type of event.


    5.5. Robustness checks

    Finally, we assess the sensitivity of our results to the length of the rolling window employed by the bubble tests. In stationary time series, a longer a window length should increase the power of the tests. However, in non-stationary time series, such as real estate prices, exhibiting commonly several structural breaks, a longer sample can complicate the bubble identification, as the price dynamics can change with changing market conditions. Intuitively, the tests might be unable to distinguish between past and current bubble periods, or label recoveries followed by protracted busts, as new exuberance periods.

    Concretely, using rolling window lengths varying between 40 and 80 quarters, with 5 quarters of increment (for a total of 9 different values), we re-calculated the bubble indicators and re-evaluated their performance for the in-sample and out-of-sample set-ups. Figure 5 presents the AUROC values of the individual and compounded indicators.

    Figure 5. Robustness checks to the length of the rolling window used by the bubble tests. Each plot shows the area under the receiver operating characteristic curves AUROCs of the denoted set of bubble indicators, when varying the window length.

    The supLPPLS, supPL, and supBT-based indicators present relatively low sensitivity to the window length. In-sample (panel 5a), the respective AUROCs remain above 0.8. Out-of-sample by period (panel 5c) and by country (panel 5e), AUROC values of the supLPPLS and supPL indicators vary only slightly, while supBT's performance peaks in the 50-70 length range and only deteriorates mildly (less than 5 percent relative to the peak) as the length becomes much shorter or larger. On the contrary, the supDF and the supKBT statistics do present substantial sensitivity to the window length. In-sample and out-of-sample, there is a visible negative and rather volatile trend as the window length increases. Their AUROC values peak at a length of 50 quarters, while bottom bellow 0.4 in the 65-70 range. Overall, these results are consistent with the relative performance of the indicators discussed in section 5.1.

    Also consistent with the previous discussions, combining the indicators generates higher AUROC values relative to the single indicators regardless of the window length. Using 1 or 2 indicators to trigger the bubble signal yields AUROC values close to 0.9 in-sample (panel 5b), and above 0.75 out-of sample (panels 5d and 5f), for most of the window lengths explored. Using 3 indicators yields somewhat more volatile results, though they remain fairly consistent in the 50-65 quarters range. Lastly, combining 4 and 5 indicators clearly leads to under-performing and highly volatile values. The latter should not be surprising, as two of the indicators (those based on the supDF and supKBT tests) yield results very sensitive to the window length.


    6. Conclusions

    This paper analyzed the information content of statistical tests for bubble detection in the context of international real estate markets. We derived binary indicators from the causal application of selected statistical tests to log house prices, and via logit regressions we assessed the indicators' in-sample and out-of-sample performance in the forecasting of tipping points of housing bubbles and systemic financial crises. We have argued that our benchmark is more robust and objective than any benchmark relying on the ex-post identification of bubble episodes using a technical rule sizing price expansions and corrections.

    Overall, our results suggest that the tests contain significant ex-ante information related to the end of the bubble period (with supPL and supBT being the tests that performed best), while combining the indicators yielded the highest performance. Robustness checks that vary the rolling window length employed by the bubble tests further support these conclusions. In addition, the application of the tests to price-to-income and rent-to-income ratios led to a substantial deterioration in the performance of the indicators, which indicates that it should be discouraged.

    The patterns that we have documented contribute to the literature in several ways. First, they have a direct impact on the literature on early warning signals and financial stress indices, à la (Misina et al., 2009) and (Christensen and Li, 2014), as bubble-tests derived indicators might be integrated into broader forecast systems. Second, they suggest that bubble tests based on price dynamics, and specially those based on the super-exponential growth pattern, constitute sound tools to identify bubble regimes and study housing bubble episodes. Finally, our results also stress the need for understanding the theoretical underpinnings behind explosiveness in price dynamics, as well as the consequences of such phenomena.


    Acknowledgments

    We are very grateful to the anonymous reviewers for valuable communications. We would like to acknowledge support for this project from the SNF, under Grant 160305.


    Conflict of interest

    The authors declare no conflict of interest.




    [1] UN. 2018. 68% of the World Population Projected to Live in Urban Areas by 2050, Says UN. UN DESA Department of Economic and Social Affairs. United Nations. United Nations. https://www.un.org/development/desa/en/news/population/2018-revision-of-world-urbanization-prospects.html.
    [2] Knorr D, Khoo CSH, Augustin M A (2017) Food for an urban planet: challenges and research opportunities. Front Nutr 4: 73.
    [3] Romeo D, Vea EB, Thomsen M (2018) Environmental impacts of urban hydroponics in Europe: a case study in Lyon. Procedia CIRP 69: 540-545.
    [4] UNEP. 2002. Vital Water Graphics: An Overview of the State of the World's Fresh and Marine Waters. UNEP. https://www.unenvironment.org/resources/report/vital-water-graphics-overview-state-worlds-fresh-and-marine-waters.
    [5] Stephen L (2018) 75% of Earth's land areas are degraded. National Geographic website. Available from: https://news.nationalgeographic.com/2018/03/ipbes-land-degradation-environmental-damage-report-spd/
    [6] Miller G, Spoolman S (2014) Chapter 10 Food Production and the environment. Environmental Science, 15th Ed. Cengage Learning: Boston, MA.
    [7] Gunders D, Bloom J (2017) Wasted: how America is losing up to 40 percent of its food from farm to fork to landfill. NRDC, R: I7-05-A.
    [8] USEPA Website a. Overview of Greenhouse Gases. Methane Emissions. USEPA, Washington DC. Available from: https://www.epa.gov/ghgemissions/overview-greenhouse-gases
    [9] Saer A, Lansing S, Davitt NH, et al. (2013) Life cycle assessment of a food waste composting system: environmental impact hotspots. J Clean Prod 52: 234-244.
    [10] USEPA (2009) Backyard Composting It's only natural. USEPA, Washington DC.
    [11] Warman PR (2005) Soil fertility, yield and nutrient contents of vegetable crops after 12 years of compost or fertilizer amendments. Biol Agric Hortic 23: 85-96.
    [12] Iqbal S, Thierfelder C, Khan HZ, et al. (2017) Maximizing maize quality, productivity and profitability through a combined use of compost and nitrogen fertilizer in a semi-arid environment in Pakistan. Nutr Cycl Aagroecosys 107: 197-213
    [13] Hepperly P, Lotter D, Ulsh CZ, et al. (2009) Compost, manure and synthetic fertilizer influences crop yields, soil properties, nitrate leaching and crop nutrient content. Compost Sci Util 17: 117-126.
    [14] Kumar B, Kumar S, Prakash D, et al. (2011) A study on sugar mill pressmud compost for some heavy metal content and their bio-availability. Asian J Plant Sci Res 1: 115-122
    [15] Mikkelsen R, Hartz TK (2008) Nitrogen sources for organic crop production. Better Crop 92: 16-19.
    [16] Zhang Y, Li C, Wang Y, et al. (2016) Maize yield and soil fertility with combined use of compost and inorganic fertilizers on a calcareous soil on the North China Plain. Soil Till Res 155: 85-94.
    [17] Sarangi BK, Mudliar SN, Bhatt P, et al. (2008) Compost from Sugar mill press mud and distillery spent wash for sustainable agriculture. Dyn Soil Dyn Plant 2: 35-49.
    [18] Farhad W, Cheema MA, Saleem MF, et al. (2011) Response of Maize Hybrids to Composted and Non-composted Poultry Manure under Different Irrigation Regimes. Int J Agric Biol 13.
    [19] Brown S, Cotton M (2011)Changes in soil properties and carbon content following compost application: results of on-farm sampling. Compost Sci Util 19: 87-96.
    [20] Stewart-Wade SM (2020)Efficacy of organic amendments used in containerized plant production: Part 1-Compost-based amendments. Sci Hortic 266: 108856.
    [21] Bonanomi G, D'Ascoli R, Scotti R, et al. (2014) Soil quality recovery and crop yield enhancement by combined application of compost and wood to vegetables grown under plastic tunnels. Agr Ecosyst Environ 192: 1-7.
    [22] Evanylo G, Sherony C, Spargo J, et al. (2008) Soil and water environmental effects of fertilizer-, manure-, and compost-based fertility practices in an organic vegetable cropping system. Agr Ecosyst Environ 127: 50-58.
    [23] Mö ller K (2018) Soil fertility status and nutrient input-output flows of specialised organic cropping systems: a review. Nutr Cycl Aagroecosys 112: 147-164.
    [24] Mu D, Horowitz N, Casey M, et al. (2017) Environmental and economic analysis of an in-vessel food waste composting system at Kean University in the US. Waste Manage 59: 476-486.
    [25] Eksi M, Rowe DB, Fernández-Cañ ero R, et al. (2015) Effect of substrate compost percentage on green roof vegetable production. Urban For Urban Gree 14: 315-322.
    [26] Tavarini S, Cardelli R, Saviozzi A, et al. (2011) Effects of green compost on soil biochemical characteristics and nutritive quality of leafy vegetables. Compost Sci Util 19: 114-122.
    [27] Leghari SJ, Wahocho NA, Laghari GM, et al. (2016) Role of nitrogen for plant growth and development: A review. Adv Environ Biol 10: 209-219
    [28] López-Cuadrado MC, Ruiz-Fernández J, Masaguer A, et al. (2005) Utilization of different organic wastes from madrid as growth media for Pelargonium zonale[C]//International Symposium on Growing Media. 779: 623-630.
    [29] Jiang Y, Li Y, Nie G, et al. (2016) Leaf and root growth, carbon and nitrogen contents, and gene expression of perennial ryegrass to different nitrogen supplies. J Am Soc Hortic Sci 141: 555-562.
    [30] Cao Y, Huang H, Wang J, et al. (2020) Crop response and quality of soil as affected by hyperthermophilic compost in Tai-Lake region of China. J Plant Nutr 43: 1000-1015.
    [31] Ceglie FG, Elshafie H, Verrastro V, et al. (2011) Evaluation of olive pomace and green waste composts as peat substitutes for organic tomato seedling production. Compost Sci Util 19: 293-300.
    [32] Mugnai S, Pasquini T, Azzarello E, et al. (2007) Evaluation of composted green waste in ornamental container-grown plants: effects on growth and plant water relations. Compost Sci Util 15: 283-287.
    [33] Barrett GE, Alexander PD, Robinson JS, et al. (2016) Achieving environmentally sustainable growing media for soilless plant cultivation systems-A review. Sci Hortic 212: 220-234.
    [34] Gong X, Li S, Sun X, et al. (2018) Green waste compost and vermicompost as peat substitutes in growing media for geranium (Pelargonium zonale L.) and calendula (Calendula officinalis L.). Sci Hortic 236: 186-191.
    [35] Massa D, Malorgio F, Lazzereschi S, et al. (2018) Evaluation of two green composts for peat substitution in geranium (Pelargonium zonale L.) cultivation: Effect on plant growth, quality, nutrition, and photosynthesis. Sci Hortic 228: 213-221.
    [36] Kranz CN, McLaughlin RA, Johnson A, et al. (2020) The effects of compost incorporation on soil physical properties in urban soils-A concise review. J Environ Man 261: 110209.
    [37] Faucette LB, Jordan CF, Risse LM, et al. (2005) Evaluation of stormwater from compost and conventional erosion control practices in construction activities. J Soil Water Conserv 60: 288-297.
    [38] Logsdon SD, Sauer PA, Shipitalo MJ (2017) Compost improves urban soil and water quality. J Water Res Prot 2017: 345-357.
    [39] Mohammadshirazi F, McLaughlin R A, Heitman J L, et al. (2017) A multi-year study of tillage and amendment effects on compacted soils. J Environ Man 203: 533-541.
  • This article has been cited by:

    1. Rashad A. Abdel-Baky, Fatemah Mofarreh, Francesco Tornabene, On an Explicit Characterization of Spherical Curves in Dual Lorentzian 3-SpaceD13, 2022, 2022, 1563-5147, 1, 10.1155/2022/3044305
    2. Nural Yüksel, Burçin Saltık, On inextensible ruled surfaces generated via a curve derived from a curve with constant torsion, 2023, 8, 2473-6988, 11312, 10.3934/math.2023573
    3. Marija S. Najdanović, Characterization of dual curves using the theory of infinitesimal bending, 2024, 47, 0170-4214, 8626, 10.1002/mma.10035
  • Reader Comments
  • © 2020 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(4551) PDF downloads(181) Cited by(4)

Article outline

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog