Research article Special Issues

The informative value of central banks talks: a topic model application to sentiment analysis

  • Central banks communication has lately become an important tool to guide expectations and its impact on the economy has been acknowledged by the literature. Nowadays central banks speeches face an increasing variety of topics, which are not discriminated by text analysis. In this paper we build a topic-weighted central bank sentiment index as a combination of machine learning and text analysis techniques to investigate large datasets. First, we develop a methodological framework to grid search the best Latent Dirichlet Allocation (LDA) model to uncover the latent topics in central banks' speeches and releases published between 2000 and 2021. Then, we build a topic-specific sentiment index based on dictionary techniques. Next, we summarise the results in a topic-weighted Central Bank Sentiment Index (CBSIw) for the Bank of Canada (BoC), the Bank of England (BoE), the European Central Bank (ECB) and the Federal Reserve (Fed). We find that the main common driver of the CBSIw is the monetary policy topic, followed by macroprudential policy and payments and settlements. We also uncover bank-specific topics and topics related to new challenges, for example innovation and climate change. Moreover, we find that the CBSIw decreases after the Great Recession, signalling a worsening in sentiment, as well as during the COVID-19 crisis. Finally, we employ a probit regression to further assess the predictive power of our monetary policy topic-specific index. We find that the indicator helps predicting future changes in policy rate, corroborating the evidence that central banks communication signals future monetary policy decisions.

    Citation: Maria Paola Priola, Annalisa Molino, Giacomo Tizzanini, Lea Zicchino. The informative value of central banks talks: a topic model application to sentiment analysis[J]. Data Science in Finance and Economics, 2022, 2(3): 181-204. doi: 10.3934/DSFE.2022009

    Related Papers:

    [1] Changjun Zheng, Md Abdul Mannan Khan, Mohammad Morshedur Rahman, Shahed Bin Sadeque, Rabiul Islam . The impact of monetary policy on banks' risk-taking behavior in an emerging economy: The role of Basel II. Data Science in Finance and Economics, 2023, 3(4): 427-451. doi: 10.3934/DSFE.2023024
    [2] Changjun Zheng, Md Mohiuddin Chowdhury, Anupam Das Gupta . Investigating the influence of ownership on the relationship between bank capital and the cost of financial intermediation. Data Science in Finance and Economics, 2024, 4(3): 388-421. doi: 10.3934/DSFE.2024017
    [3] Dominic Joseph . Estimating credit default probabilities using stochastic optimisation. Data Science in Finance and Economics, 2021, 1(3): 253-271. doi: 10.3934/DSFE.2021014
    [4] Angelica Mcwera, Jules Clement Mba . Predicting stock market direction in South African banking sector using ensemble machine learning techniques. Data Science in Finance and Economics, 2023, 3(4): 401-426. doi: 10.3934/DSFE.2023023
    [5] Mesbaul Haque Sazu, Sakila Akter Jahan . Impact of blockchain-enabled analytics as a tool to revolutionize the banking industry. Data Science in Finance and Economics, 2022, 2(3): 275-293. doi: 10.3934/DSFE.2022014
    [6] Zimei Huang, Zhenghui Li . What reflects investor sentiment? Empirical evidence from China. Data Science in Finance and Economics, 2021, 1(3): 235-252. doi: 10.3934/DSFE.2021013
    [7] Sina Gholami, Erfan Zarafshan, Reza Sheikh, Shib Sankar Sana . Using deep learning to enhance business intelligence in organizational management. Data Science in Finance and Economics, 2023, 3(4): 337-353. doi: 10.3934/DSFE.2023020
    [8] Moses Khumalo, Hopolang Mashele, Modisane Seitshiro . Quantification of the stock market value at risk by using FIAPARCH, HYGARCH and FIGARCH models. Data Science in Finance and Economics, 2023, 3(4): 380-400. doi: 10.3934/DSFE.2023022
    [9] Yen H. Hoang, Duong T.T. Nguyen, Linh H.T. Tran, Nhung T.H. Nguyen, Ngoc B. Vu . Customers' adoption of financial services offered by banks and fintechs partnerships: evidence of a transitional economy. Data Science in Finance and Economics, 2021, 1(1): 77-95. doi: 10.3934/DSFE.2021005
    [10] Ying Li, Keyue Yan . Prediction of bank credit customers churn based on machine learning and interpretability analysis. Data Science in Finance and Economics, 2025, 5(1): 19-34. doi: 10.3934/DSFE.2025002
  • Central banks communication has lately become an important tool to guide expectations and its impact on the economy has been acknowledged by the literature. Nowadays central banks speeches face an increasing variety of topics, which are not discriminated by text analysis. In this paper we build a topic-weighted central bank sentiment index as a combination of machine learning and text analysis techniques to investigate large datasets. First, we develop a methodological framework to grid search the best Latent Dirichlet Allocation (LDA) model to uncover the latent topics in central banks' speeches and releases published between 2000 and 2021. Then, we build a topic-specific sentiment index based on dictionary techniques. Next, we summarise the results in a topic-weighted Central Bank Sentiment Index (CBSIw) for the Bank of Canada (BoC), the Bank of England (BoE), the European Central Bank (ECB) and the Federal Reserve (Fed). We find that the main common driver of the CBSIw is the monetary policy topic, followed by macroprudential policy and payments and settlements. We also uncover bank-specific topics and topics related to new challenges, for example innovation and climate change. Moreover, we find that the CBSIw decreases after the Great Recession, signalling a worsening in sentiment, as well as during the COVID-19 crisis. Finally, we employ a probit regression to further assess the predictive power of our monetary policy topic-specific index. We find that the indicator helps predicting future changes in policy rate, corroborating the evidence that central banks communication signals future monetary policy decisions.



    Over the last decades central banks have increasingly relied on communication as a tool for monetary policy and, since the financial crisis, there has been a trend towards an increased level of transparency. Information has been disclosed in official documents, during broadcast interviews and on social networks. The impact of this effort has been widely acknowledged in the literature. Similarly, there is an increasing interest of policymakers and market operators in the economic "sentiment" expressed by central banks in their communication. For this reason, a flourish of literature on text analysis applied to central banking has emerged in recent years, since with this technique it is possible to obtain a metric of the qualitative information embedded in communication (Loughran and McDonald, 2011; Picault and Renault, 2017; Correa et al., 2017). The sentiment indicators so obtained are used to investigate the effects of central banks' communication on macroeconomic and financial variables or to show interconnections among central banks' policies (Armelius et al., 2020; Picault and Renault, 2017). Moreover, these indicators work as a proxy of survey-based indices on economic sentiment (Shapiro et al., 2020). However, one drawback of the lexicon-based technique with large datasets is that sentiment is created without extrapolating the context in which the words are used. Therefore, it provides a sentiment metric of a given speech regardless of the subject.

    In this paper we contribute to the literature on central banks communication along several dimensions. We merge topic modelling and text analysis techniques to create a topic weighted Central Bank Sentiment Index (weighted CBSI or CBSIw) for the Bank of Canada (BoC), the Bank of England (BoE), the European Central Bank (ECB) and the Federal Reserve (Fed). To start with, we apply Latent Dirichlet Allocation (LDA), introduced by Blei et al. (2003) to uncover the latent topics in central banks communication and investigate their evolution over time. Given that LDA requires a priori assumptions on the model setup and there is no guidance for the optimal hyperparameters choice, we develop a procedure based on the grid search method. Next, we build a topic sentiment index with a lexicon methodology in line with Priola et al. (2021) to monitor the sentiment relative to each learned topic. Then, we combine the indexes to obtain the CBSIw that informs us not only about the economic sentiment of central banks' communication, but also it informs us about the relative importance of sub-topics. Furthermore, the sub-indexes can be monitored to investigate the sentiment on specific arguments, e.g. the climate change. The indicator is calculated as weighted monthly average of the topics, where the weights are given by the number of speeches on each topic, each month. Finally, we examine the monetary policy predictability by employing a probit model to regress our monetary policy sentiment indicator on future policy rate decision. Our main findings are the following. First, the main common driver of the weighted CBSI is the monetary policy topic, followed by macroprudential policy and payments and settlements. There are only a few topics that are not relevant for all four central banks. This is the case of the promotion of consumer protection for the Fed and the need of financial integration for the ECB, while international affairs is more relevant for the BoC. Second, the weighted CBSI decreases for all banks after the Great Recession, signalling a worsening in sentiment, and drops during the pandemic crisis. Third, probit estimation shows that the monetary policy topic-indicator is positive and significant for the all Central Banks with the exception of Bank of Canada, giving support to the claim that the monetary policy communication helps predicting future changes in policy rates.

    The rest of the paper is organised as follows. Section 2 reviews the empirical literature. Section 3 describes the collection of documents that were analysed and provides descriptive statistics about the panel of Central Banks. Section 4 discusses the methodology to identify the topics and build the weighted CBSI. Section 5 presents the LDA clustering results and Section 6 provides evidence on the predictive power of our sentiment indicator. Section 7 concludes.

    Our paper mainly contributes to the literature on central banks communication and text analysis, or Natural Language Processing (NLP). Text analysis techniques allow to automatically organise, analyse and summarise large quantities of information. Furthermore, they have the advantage to eliminate the subjectivity associated with the human reading approach. The related literature is growing together with the increased amount of central bank communication, which make reading through all the documents challenging.

    One approach is to use dictionary-based techniques following Loughran and McDonald (2011). Priola et al. (2021) construct their own Central Bank Sentiment Indexes (CBSIs) applying text analysis to a large dataset of speeches and releases published by for the Federal Reserve (Fed), the European Central Bank (ECB), the Bank of England (BoE) and the Bank of Japan (BoJ) between 2000 and 2020. This method consists in automatically matching a set of words with a pre-given connotation (a dictionary or lexicon), to words in the text of interest. After having determined the frequency of words with their particular connotation, the literature suggests several approaches to build an indicator. The authors proposed the Net Score Index, similar to the approach developed by Birz and Lott Jr (2011) to build their CBSIs.

    An appealing feature of dictionary-based techniques is their simplicity. However, a major limitation of text analysis is that it does not capture the context in which the words are used. Other approaches, such as topic modelling, address this issue. A topic model adopts a flexible statistical structure that clusters words together to form "topics", allowing the same word to belong to more than one topic. Such approach is used to uncover the hidden semantic structure in a text. A common approach is the Latent Dirichlet Allocation (LDA), introduced by Blei et al. (2003). This algorithm has been widely used in social and behavioural science, and lately, to study central bank communication. Machine learning algorithms such as the LDA model can be used to study and uncover topic trends within central banks communication. Given the increase in the amount of speeches and releases and their always increasing importance as a monetary policy tool, the ability to monitor such communications is potentially useful for real-time analysis. This technique can be also used to monitor topic trends over time.

    Hansen et al. (2018) employ text analysis to study how greater external communication affects the internal deliberation of monetary policymakers in the Federal Open Market Committee (FOMC). Their database is made of 149 FOMC meeting transcripts from August 1987 to January 2006, under the chairmanship of Alan Greenspan. They construct measures of communication based on both basic text counts and on topic models and use them as a dependent variable in an econometric model. Their LDA model is estimated for all the statements in each meeting in the whole sample, while the econometrics analysis focuses on the statements made in each meeting during the stage of the economic situation discussion and the policy discussion. Tumala and Omotosho (2019) choose LDA to analyse the communication strategy of the Central Bank of Nigeria (CBN) during the period 2004-2019. They base their analysis on 87 policy communiques released after the meetings of the CBN's monetary policy committee and published by the Bank on its website. They estimate a LDA model to identify the key topics driving monetary policy communication during the sample period. Thus, they analyse the evolution of the topics by plotting the topic proportion for the identified topics. Their results show that the topics are influenced by the Bank's announced policy objectives as well as the nature of shocks hitting the economy per period. Keida and Takeda (2018) apply LDA to 70 regular press conference documents of the Bank of Japan's (BOJ) governors Masaaki Shirakawa and Haruhiko Kuroda from July 2012 to July 2018. The authors find that NLP techniques are useful for identifying changes in monetary authorities, such as when Shirakawa succeeded Kuroda. Carboni et al. (2020) investigate the content of the ECB and Fed Governors' speeches for the period 2007-2019 using LDA. Their database consists of 313 speeches of Governors Jean Claude Trichet and Mario Draghi of the ECB and 193 speeches of Governors Ben Bernanke, Janet Yellen and Jerome Powell of the Fed. In the speeches of both central banks they find that topics linked to traditional subjects like monetary policy and financial stability play the main role. They find small tracks of topics related to new challenges, for example environmental sustainability.

    To investigate the predictive power of our weighted central bank indicators we use a probit model, which is a standard methodology to assess the significance of central bank communication*, as shown by recent research. Apel and Grimaldi (2014) examine whether a rate policy decision can be explained by their hawk/dove sentiment indicator for the Swedish Riksbank, after controlling for the lagged dependent variable and macroeconomic variables, and discover that their measure helps to predict policy decisions. Cour-Thimann (2020) estimate the ECB reaction function augmented with their communication indicator, finding that sentiment estimated from minutes significantly improve the reaction function. Bennani et al. (2020) investigate the ECB communication as a signal of monetary policy change proxied by MRO rate or the shadow rate, by controlling for the zero-lower bound (ZLB) and financial market expectations. Their results suggest that communication about conventional measures helps explaining the future ECB rate changes, even controlling for the ZLB period.

    *See Gerlach-Kristen (2004)

    Our dataset includes speeches and releases available for download from the institutional websites of the European Central Bank, the Bank of England, the Bank of Canada and the Federal Reserve. The choice relies on the fact that these countries are the world's largest advanced economies according to the IMF; therefore, the global economy is largely influenced by the communication of their central banks, which it is worth being investigated. For the Fed, we also include the FOMC meeting transcripts available for download from the website of the Federal Reserve Bank of St. Louis. We only extract the speech transcription, avoiding page headers and footnotes. Speeches can vary in length, depending on the event at which the speech is delivered; releases are typically short and standardized, as they are the summary of monetary policy decisions or press releases with a focus on monetary policy. Overall, our dataset includes 6137 documents spanning the period from January 2000 to April 2021 and distributed as follow: 1007 for the BoC, 1319 for the BoE, 2306 for the ECB and 1505 for the Fed.

    The monthly and yearly distributions of our dataset are shown in Figure 1. It is worth mentioning that the number of documents published by the Banks steadily increased from the first half of the 2000s, later stabilizing in the 2010s. This finding is in line with the increase in central banks' transparency documented by Dincer and Eichengreen (2013). We notice a substantial increase in publications between 2006 and 2010 for the ECB, the BoC and the BoE, signalling that the financial crisis and the Great Recession had an impact on central banks' transparency. Instead, the largest increase for the Fed is observable during the last years of Greenspan's chairmanship (2000-2006).

    Figure 1.  Yearly and monthly distribution of speeches and releases for each central bank.

    We clean and process the documents by applying a set of text mining techniques, along the lines of the literature in the field of NLP (Silge and Robinson, 2017). Such transformations are needed to carry out the empirical analysis. In detail, the documents are converted in text-corpus following a methodology detailed by Grün and Hornik (2011). We clean the corpus by lowercasing the words and removing numbers, extra spaces, punctuation, regular expressions and standard stopwords. After this parsing phase, we tokenize* the text and extract unigrams and bigrams. Then, we create a list of bank-specific stopwords including Governors' names and non-English words, leaving out non-English common words with economic relevance (such as COVID-19 and Brexit). Finally, we process the tokens with the Porter stemming algorithm (Porter, 1980) to eliminate common suffixes and prefixes. As a result of this phase, we achieve a document-term matrix.

    *Tokenization is the process of breaking the text into blocks readable by the software.

    n-grams are series of n adjacent elements from a given string of tokens.

    Next, we prune the dataset using the Terms frequency-Inverse document frequency (Tf-Idf) function (Baeza-Yates and Ribeiro-Neto, 1999) calculated on each central bank's document term matrix. The Tf-Idf is a function that weighs the importance of a term within to the whole dataset (Leskovec et al., 2020). The function is expressed as:

    (tfidf)i,j=tfi,jidfi (1)

    Pruning is the process of reducing the dataset size to improving the learning of a machine learning model.

    with

    tfi,j=ni,j|dj|;idfi=log(|D||{d:id}|) (2)

    where tfi,j stands for Term frequency of word i in the document j; ni,j is the number of occurrences of word i in the document j, while |dj| is the length of document j as the sum of its terms. In addition, idfi stands for Inverse document frequency of word i in the collection of documents D and |D| is the number of the documents in the sample. The denominator measures the number of documents in which i is present. This function increases the weight of terms that occur very frequently in a given document but that are less frequent in the whole collection of documents (Grün and Hornik, 2011). As a result, common words are filtered out and the resulting word rank is more meaningful than the absolute frequency approach*. Grün and Hornik (2011) clean the dataset by omitting all terms with a Tf-Idf value below 0.01. We fine-tune their approach by choosing a flexible threshold specific to each document, rather than pruning all the dataset for a fixed value. In fact, for each document, we discard all the terms having a Tf-Idf value below the median of the word distribution in a given document. Our final dataset includes 4267 words for the BoC, 8972 words for the BoE, 8363 words for the ECB, 8075 words for the Fed.

    *The rank is a list of words in order of decreasing frequency, i.e. from the most to the less used word.

    In Figures A1–A4 in Appendix we provide a visual representation of the pruned dataset for each bank using word clouds, including the 100 most important words according the Tf-Idf. In the word clouds, the rank given by the Tf-Idf is represented by the size of the word itself. Many terms are common almost to all banks: notably, "monetary policy", "inflation", "risk" and "rate. Nevertheless, there are some things to notice: while "inflation" is among the first words extracted for the Fed and the BoC, this is not the case for the ECB and the BoE. Furthermore, for the Fed, both the words "economy" and "market" are more frequent than for the other banks. Finally, it is also interesting to note the high frequency of the term "treaty/ies" for the ECB, because policy makers need to refer quite often to the legal basis of the institution.

    Finally, we investigate each central bank's text distribution according to Zipf's Law (Zipf, 1936). Zipf's law states the inverse relationship between the frequency of a term and its rank, Figures A5–A8 in Appendix show the theoretical Zipf's curve compared to the empirical ones calculated on the corpus pre and post pruning, finding the Zipf's law holds when pruning the dataset.

    In this paper we use the Latent Dirichlet Allocation, introduced by Blei et al. (2003). LDA is a generative probabilistic model. The basic idea is that documents are represented as random mixtures over latent topics, and each topic is characterized by a distribution over words. The model takes a corpus as an input and gives as output the distribution of topics for each document in the corpus and the distribution of words for each topic. Figure 2 shows the plate notation for the model. Assuming that the corpus consists of M document, each of length Ni, LDA assumes the following generative process.

    Figure 2.  Plate notation for LDA.

    ● For each document, choose θiDirichlet(α) for i = 1, …, M, where α is the concentration parameter of the Dirichlet prior on the per-document topic distribution.

    ● For each topic, choose φkDirichlet(β) for k = 1, …, K, where β is the concentration parameter of the Dirichlet prior on the per-topic words distribution.

    ● For each word position (i,j), where i = 1, …, M and j = 1, …, Ni

    – Choose a topic zi,jMultinomial(θi)

    – Choose a word wi,jMultinomial(φzi,j)

    The words wi,j are the only observable variables, while θ, φ and z are latent variables. Going through the three steps above, the algorithm infers the posterior distribution of the latent variables:

    p(θ,φ,z|w,α,β)=p(θ,φ,z|α,β)p(w|α,β) (3)

    After the posterior estimates for φ and θ are obtained, the algorithm returns the topic representation of each document in the corpus.

    LDA is a complex algorithm which is generally hard to fine-tune and interpret. Besides the text cleaning and pre-processing, getting relevant results with LDA depends heavily on the model setup: α and β which must be provided as input to the algorithm, as well as the number of topics K. The hyperparameters α and β affect the sparsity of the Dirichlet distributions. With a higher α, documents are assumed to be composed of more topics and this results in a more specific per-document topics distribution. This means that when choosing a low α it is more likely that a document may contain just a mix of few topics. Likewise, with a higher β, topics are assumed to be composed of more words and this results in a more specific per-topic words distribution. With the last input parameter K, the LDA model clusters documents into different topics.

    In section three, we described the procedure we adopt to clean and pre-process the data. In this section, we explain how we select the hyperparameters and estimate the LDA for the banks in our sample. *

    *To run the model, we use Gensim library in Python.

    In literature there is unfortunately no guidance for the choice of the hyperparameters. Following the general advice of Griffiths and Steyvers (2004) and Navarro et al. (2006), Hansen et al. (2018) set α = 50K and β = 0.025, and estimate the model for K = 50 and K = 70. Tumala and Omotosho (2019) and Keida and Takeda (2018) heuristically assume a 6 topic model and a 3 topic model, respectively. Carboni et al. (2020) look at a 5-topic and 10-topic model and, based on human judgement, pick the 10-topic model. In this paper we mainly rely on topic coherence measures, which provide a convenient way to assess the quality of the topic extracted. They measure the degree of semantic similarity between the most frequent words in a topic and allow to separate topics that are interpretable from those that are purely statistical. The cv measure, introduced by Röder et al. (2015), is a confirmation measure that combines normalized pointwise mutual information (NPMI) with the cosine vector similarity. Given a word wkj, let vj be defined as:

    vji=NPMI(wi,wj)γ=(logp(wi,wj)+ϵp(wi),p(wj)log(p(wi,wj)+ϵ))γi=1,...,N (4)

    cv measure is the arithmetic mean of the top words cosine similarities:

    cos(u,v)=Ni=1uiviNi=1u2iNi=1v2i (5)

    As said, there is no benchmark for the choice of optimal LDA hyperparameters and therefore the selection is up to the researcher. Hence, one of our attempts it to outline a procedure that one could easily follow with less involvement of human judgement. Our approach for finding the optimal hyperparameters is to perform an exhaustive grid search. Going through a specified subset of the hyperparameters space, we train several models and generate candidates α, β and K. As a final step, we pick the combination of α, β and K that optimize the coherence scores. In detail, based on human judgement, we define a range in which the hyperparameters can vary. After investigating each central bank's institutional website, we assume our dataset to be made up of only a few labels and our words to belong to some of the topics. With this in mind, α, β are typically set below one. The maximum number of topics K is set to 70, given that a larger number of topics may hamper interpretability. Thus, for each K, we train the model for α[0.01,1] in steps of 0.1. For each K and α, we train the model for β[0.01,0.41] in steps of 0.1. We train each model on 90% of the documents and, for each of them, we calculate cv. We estimate the posterior distribution of the latent variables using the online variational Bayes inference algorithm developed by Logsdon et al. (2010)*.

    *We set the following values for the training parameters: iterations = 300 (maximum number of iterations through the corpus when inferring the topic distribution of the corpus), chunk size = 50% of the collection (number of documents to consider at once during the training), passes = 10 (how many times the algorithm is supposed to pass over the whole corpus during the training).

    After training all the models, for each K we pick the pair (α, β) that optimize the coherence scores. Our final choice is to use the pair (α, β) for which cv is optimized more frequently across all K. Thus, plotting the cv curve, we pick K around the optimal values of cv that best fit interpretability, based on our sensitivity. Finally, we examine the resulting topics and the associated top ten words. The probabilities associated to each word reflect how important the word is to that topic. We cluster the topics in ˉK bank-specific labels to obtain a list of topics that are interpretable and exhaustive.

    Once the LDA is trained, we test the model on unseen documents (10% of the corpus). The testing phase consists in reading the speeches and judge whether the label assigned by the model is coherent with the effective topic. For all the banks in the sample, the percentage of documents that are misclassified by the algorithm is 5% on average.

    For each central bank we build a CBSI for each label, with the methodology described in Priola et al. (2021). We choose the Loughran McDonald dictionary* motivated by the broad consensus in the literature on the validity of this dictionary when applied to finance text analysis (Shapiro et al., 2020; Armelius et al., 2020). Thus, we use the Loughran-McDonald dictionary to assign a value of one if the connotation of the words in a document is positive, minus one if it is negative. We do not consider neutral words. After having determined the frequency of words with the positive and negative connotation, we calculate a normalized score as follows:

    NetScoreIndext=#Positivet#Negativet#Positivet+#Negativet (6)

    *To better reach out the sentiment of the last year, we augmented the dictionary with the words "pandemic" and "coronavirus" among the negative ones; this negative connotation is only temporary and subjected to further research of the authors, with the aim of enriching the standard dictionary.

    where the numerator is the difference between the total number of positive and negative words, while the denominator is the total number of positive and negative words. This score (named Net Score from now on) has the advantage of being comparable even among texts with different length without creating distortions in the indicator. The range of the Net Score Index is between -1 and 1. High scores indicate a positive view of the economic context by the institution, whereas low levels denote a negative outlook.

    The Net Score can show extreme values, i.e. with very short speeches where one or two words are captured in the text. In this case, we opt for NaN values to avoid excessive noise.

    Using the subset of documents belonging to the label k we build the CBSIkt, k = 1, …, ˉK. Thus, we aggregate in monthly averages the indexes. As a final step, we calculate the weighted average:

    CBSIwt=ˉKk=1wk,tCBSIktˉKk=1wk,t (7)

    where wk,t is the ratio of the number of documents in which the topic k appears to the total number of documents at month t.

    In this section we provide first a description of the optimal LDA hyperparameters and the learned topics. Second, we discuss the results of our analysis for each central bank in the sample.

    As explained in Section 4, we choose the pairs (α, β) that most frequently optimize cv. Having fixed the hyperparameters, we calculate the coherence measure for each K and use it as a guidance for choosing K. We look at the models for which the resulting coherence function is relatively higher and opt for the K that delivers a list of exhaustive and interpretable topics. As a result of our experimental procedure, we find that the optimal values are those shown in Table 1. It is worth noting that for the ECB the optimal α and β are the lowest of the gridsearch (0.01 and 0.11 for α and β, respectively). This might be explained by the larger number of documents and the multitude of speakers that characterize the ECB dataset.

    Table 1.  Summary results for LDA training.
    Central Bank K* α* β*
    BOC 55 0.41 0.31
    BOE 35 0.61 0.31
    ECB 60 0.01 0.11
    FED 30 0.61 0.21

     | Show Table
    DownLoad: CSV

    For each bank, we cluster the topics resulting from the model training and assign a label to each cluster. Overall, we create eleven labels, described below.

    Climate Change (CLIMATE). Documents on climate change issues related to monetary policy and the economy.

    Consumer Protection and Community Development (CONSCOM). As stated by the Federal Reserve itself, documents about promoting Consumer Protection and Community Development advances supervision, community reinvestment, and research to increase understanding of the impact of financial services policies and practices on consumers and communities* with the aim of promoting financial literacy.

    * https://www.federalreserve.gov/aboutthefed.htm

    EMU Affairs (EMU_AFF). Documents on European Monetary Union whose contents examine themes such as the affiliation of new member State and/or the adoption of the euro in a country, as well as issues related to the European governance.

    Financial Integration (FINTEGR). Documents on developments of a well-integrated financial system.

    https://www.ecb.europa.eu/home/search/financial_integration/html/index.en.html

    Financial Stability and Macroprudential Policy (MACPRUD). Documents relating to supervising and regulating financial institutions and activities; corporate risk management.

    Fiscal Policy (FIS_POL). Documents on pensions, funds and fiscal issues.

    Innovation (INNOV). Documents on technology, digitalization. It may include also survey, reports on data and central banks' research.

    International Affairs (INT_AFF). Documents about international governance and the global economy, international trade.

    Monetary Policy (MON_POL). Documents relating to each central bank's specific monetary policy framework; economic outlook and reports.

    Other Topics (MISCELL). Documents on arts, culture and not interpretable topics.

    Payments and Settlements (PAYSECS). Documents about coins and banknotes releases, credit cards, payment settlements. More generally, everything involving the payments system and its infrastructure. It includes digital payments when INNOVAT is not clearly clustered.

    In Table 2, we show the number of topics clustered under each label for each bank. Ignoring the recipient of irrelevant or intractable results (MISCELL), we find as expected that MON_POL is one of the most frequent topics, followed by MACPRUD and PAYSECS. Those topics are shared by all institutions, being part of their mandate. For the Bank of Canada, we discover 7 labels. After Monetary Policy and Macroprudential Policy, International Affairs' weight is in line with the importance of the global trade strategy that Canada has been developing to diversify its economy and improve competitiveness. In fact, Canada heavily relies on trade as it represents more than 60 percent of its output* - one of the highest ratios among the G7 countries. To sum up, international trade is "the lifeblood of the Canadian economy". The Bank of England is the one with less topics and the labels MON_POL and MACPRUD comprehend almost all topics. This result is as expected, since the institutional webpage allow us to explore speeches by filtering only for few topic options. The European Central Bank is the target bank whose topics show the highest variety of subjects (9), possibly due to the largest dataset. Again, the MON_POL and MACPRUD labels include the highest number of topics. The Federal Reserve is characterized by 6 labels, where MON_POL and MACPRUD are almost equivalent in terms of number of topics and CONSCOM is specifically of the Fed, since promoting Consumer Protection and Community Development is one of central bank's tasks.

    Table 2.  Topics frequency distribution.
    Topics BoC BoE ECB Fed
    CLIMATE 1 - 1 -
    CONSCOM - - - 6
    EMU_AFF - - 6 -
    FINTEGR - - 1 -
    FIS_POL 1 - 2 -
    INNOVAT 2 - 3 1
    INT_AFF 6 1 3 1
    MACPRUD 7 7 10 8
    MISCELL 16 16 8 3
    MON_POL 18 10 22 9
    PAYSECS 4 1 4 2
    Num of Topics 55 35 60 30

     | Show Table
    DownLoad: CSV

    *Trade is the sum of exports and imports (of goods and services) as a share of gross domestic product. World Bank data, 2019.

    Remarks by Mr Timothy Lane, Deputy Governor of the Bank of Canada, at the Saskatoon Regional Economic Development Authority, Saskatoon, Saskatchewan, 18 September 2017.

    In Figures 3-6 we show the yearly topic distribution. For the BoC, MON_POL includes 668 speeches and releases, accounting for 65% of the documents. The topic MACPRUD, consisting of 182 documents, weighs on average 18%. The third topic ranked by relative frequency INT_AFF has 102 documents and on average weighs 10%. For the BoE, on average MON_POL represents 60% of the documents with 788 speeches or releases. The second topic MACPRUD consists of 403 documents and a weight of 30% overall. The third topic ranked by relative frequency PAYSECS has 70 documents. For the ECB the main label MON_POL includes 58% of the documents set with 1345 speeches or releases. The second topic MACPRUD consists of 414 documents and a weight of 18% overall. The third topic ranked by relative frequency EMU_AFF has 151 documents. FIS_POL differs from others central banks as its speeches are related to the opportunity of a fiscal union, as a view expressed by the central bank. FINTEGR consists of 140 documents (7% of total) with a focus on the opportunity – as well as the urgency – of a banking union. For the Fed, MON_POL accounts for half the dataset (686 documents), followed by MACPRUD that weighs 30% (446 documents). CONSCOM has 225 speeches in total.

    Figure 3.  BoC Yearly Topic Distribution.
    Figure 4.  BoE Yearly Topic Distribution.
    Figure 5.  ECB Yearly Topic Distribution.
    Figure 6.  Fed Yearly Topic Distribution.

    Again, in Figures 3-6 we show the relative yearly frequency and infer the topic trends by looking at the evolution of their frequency over time. Since the subprime crisis (2007-2009) the central banks have given more attention to macroprudential policy issues. In those years, BoC's focus shifted to macroprudential policy and international affairs. In the last six years, the central bank started talking about digital payments, financial technology and all innovation-related topics. Although the BoE does not share INNOVAT among the identified topics, it focuses on digital payments that are included in PAYSECS segment as shown by the trend of the last past five years. For the ECB, we also notice that PAYSECS and EMU_AFF were more relevant in 2000-2005 as a consequence of the Euro introduction and new member countries joining the EU. In the following years, central bankers started to talk about fiscal integration (FIS_POL). Lastly, CLIMAT appears in 2018, following the European Commission Action Plan on Sustainable finance (March 2018). For the Fed, we can see that during the subprime years the focus increases to contents about Consumers and Community, the ones heavily hit by the financial crisis. In fact, we find that financial literacy is a recurrent subject within the documents on this topic. This is because financial distress is not always caused by a lack of financial education, but in many cases such knowledge mitigates the consequences of unaware decisions*. In the last years, Fed's Governors touched upon themes such as digital payments and fintech.

    *Remarks by Alan Greenspan, Chairman of the Board of Governors of the US Federal Reserve System, before the National Council on Economic Education, Chicago, 26 October 2001.

    Next, we extract the 10 top words for each K and aggregate the word topic distributions by label. An exhaustive list of top words with assigned probabilities are in Appendix, Tables A5–A8. We provide descriptive statistics of CBSIw and its sub-indicators in Appendix, Tables A1–A4. In Figures 7-10 we show the CBSIw with both global and country-specific events, such as the Great Recession and the beginning of the COVID-19 crisis (dotted lines) and the European Debt Crisis (shaded area). During the years of the subprime crisis (2007-2009) all topic-sentiment indicators moved downwards. To understand better what happened, we plot the weighted CBSI together with the MON_POL indicator. We see that for all banks the indicator dropped in 2007, when the subprime crisis started. Afterwards, all the CBSIw improved during the period of economic recovery. Furthermore, the ECB indicator goes below zero during the European sovereign debt crisis, in the years from 2010 to 2015. We also find a remarkable drop for all central banks in April 2020 due to the pandemic emergency. In particular, ECB's sentiment reached its historical minimum in April 2020. These findings are in line with Priola et al. (2020), which build a Central Bank Sentiment Index on the whole dataset of speeches and releases for the BoE, the BoJ, the ECB and the Fed. Next, we look at the correlation between the weighted CBSI and the CBSI sub-indicators, which gives us a measure of which topic is driving the trend of the weighted CBSI. Considering the topics with a consistent number of speeches, we find that MACPRUD, MON_POL and PAYSECS are the leading topics for the BoC with a Pearson correlation coefficient at least above 0.8; the BoE's main driver is MON_POL with 0.84. For the Fed the highest correlation is shown by CONSCOM (0.71). MON_POL is also the driving cluster for the ECB (0.85). Those statistics are shown in Appendix.

    Figure 7.  BoC CBSIw and policy rates.
    Figure 8.  BoE CBSIw and policy rates.
    Figure 9.  ECB CBSIw and policy rates.
    Figure 10.  Fed CBSIw and policy rates.

    Finally, we investigate the relation between the central bank sentiment, proxied by our weighted CBSI and the relative policy rate. Since CBSIw shows an erratic behavior due to its textual nature, we clean the indicator by extracting the signal with a Kalman filter algorithm (Kalman, 1960), labelled as CBSIw_kf. In Figures 7-10, we find that the CBSIw_kf shows evidence of positive correlation for all the policy rates, with a strong correlation for the BoC (0.86), for the ECB (0.76) and a moderate correlation for the BoE (0.52) and the Fed (0.61). We find a misalignment between the two variables for a window of time after the 2008 global financial crisis, where central banks adopted non-standard monetary policies and reached the so-called effective lower bound (ELB). In details, the BoC determined that the ELB on its key policy rate was 0.25 percent in 2009*(Witmer and Yang, 2015). The BoE cut the Bank Rate to 0.50 percent in March 2009 and started undertaking a large-scale asset purchases, implying that the current rate was viewed as the effective lower bound for the policy rate (Swanson and Williams, 2014); the Fed kept the federal funds target rate essentially at zero from December 2008 to December 2015, while the ELB has been recently identified between August 2011 and December 2012 (Benamar et al., 2021). For the ECB the break starts in 2014, when the ECB lowered its deposit facility rate below zero while keeping the Main Refinancing Operations (MRO) unchanged. Under the ELB environment the Deposit Facility Rate signalled the monetary policy stance more closely (Cour-Thimann, 2020). This evidence suggests that during the period of quantitative easing key policy rates become less informative, in line with the literature that proposes new indicators to quantify the effects of non-standard policies on the economy (Wu and Xia, 2016).

    *In December 2015, the BoC released a new Monetary Policy Framework, estimating that its new ELB could be around -0.50 percent. See the speech given by Poloz (2015) for more details.

    In this section, we assess the quality of information extracted by the LDA algorithm and examine the predictive power of our monetary policy sentiment indicator. We use the following ordered probit model

    Δrt+1=α1Δrt+α2MON_POLt+ϵt (8)

    Since policy rates typically move by a discrete amount, e.g. by 25 points (Rosa, 2009), we choose the ordered probit method because it takes account of the discrete nature of variables (Hayo and Neuenkirch, 2010). We control for Δrt+1 that is the policy rate decision at time t+1, a categorical variable taking values (-1, 0, 1), correspondent to a rate cut, an unchanged rate, a rate hike. Δrt is the policy rate decision at time t. MON_POLt is the sub-index related to the monetary policy label, used as a proxy of central bank communication in the baseline specification. Next, we augment the model according to the Taylor rule:

    Δrt+1=α1Δrt+α2MON_POLt+α3ΔGDPt+α4ΔINFt+ϵt (9)

    where ΔGDPt is the GDP growth year-on-year. We interpolate the quarterly GDP via dynamic Chow-Lin developed by Silva and Cardoso (2001) using as leading indicator the industrial production index. ΔINFt is the CPI changes year-on-year. Data is at monthly frequency, covering January 2000 to March 2021. We replace in (2) the sub-index with the CBSIwt as a robustness check, given that the MON_POLt sub-index is its major driver, as explained in Section 5. For every estimation we report the McFadden's likelihood ratio index (McFadden, 1974) as a goodness of fit for the ordered probit models, given by:

    PseudoR=1LLmodLL0 (10)

    where LLmod is the log-likelihood value for the fitted model and LL0 is the log-likelihood value for the baseline model that includes only the intercept as regressor. McFadden's likelihood ratio index ranges from 0 to 1.

    Tables 3 shows the results of the estimation. Column Ⅰ is the baseline model, a regression of the future change in the policy rate on the change in the policy rate in the previous period. Column Ⅱ is the model with our MON_POL indicator. The results in column Ⅱ suggest that in model (1) the monetary policy sentiment index helps predicting the next policy rate decision for all the Banks, with the exception of the Bank of Canada. It worth noting that we do not find a significant result for the BoC specification (Ⅱ) as it can be affected by collinearity, given the high correlation between the series. Hence, we test the probit specification without the lagged dependent variable (last policy rate decision) finding that the CBSIw is significant for all the banks. In column Ⅲ, the results obtained controlling for the macroeconomics variables based on the Taylor rule, show that the monetary policy indicator remains informative. The result holds for the Federal Reserve, the European Central Bank and the Bank of England. Next, we run the same regression specification replacing the MON_POL index with the weighted CBSI. Results are shown in columns Ⅴ and Ⅵ. Our weighted central bank sentiment index is still informative, also after accounting for macroeconomic variables. For the Fed, results are consistent with Hayo and Neuenkirch (2010), who analyse the importance of Federal Reserve communication to predict federal funds target rate decisions for the sample period 1998-2006. For the ECB, results are also in line with Picault and Renault (2017) that show the importance of the ECB communication index for future monetary policy and stock market dynamics.

    Table 3.  Ordered probit estimation results.
    variables (Ⅰ) (Ⅱ) (Ⅲ) (Ⅳ) (Ⅴ) (Ⅵ)
    BANK OF CANADA
    Δrt 1.15***, (0.1864) 1.12***, (0.1887) 1.06***, (0.1931) 1.08***, (0.191) 1.1***, (0.1895) 1.05***, (0.1932)
    CBSIwt 0.15*, (0.0857) 0.11, (0.0882)
    MON_POLt 0.09, (0.085) 0.05, (0.0873)
    ΔGDPt 0.28***, (0.1061) 0.29***, (0.1047) 0.27**, (0.1071)
    ΔINFt -0.18*, (0.1028) -0.19*, (0.1014) -0.18*, (0.102)
    -1/0 -1.41***, (0.1187) -1.41***, (0.1188) -1.45***, (0.1227) -1.45***, (0.1228) -1.42***, (0.1191) -1.45***, (0.1228)
    0/1 1.43***, (0.1196) 1.44***, (0.1208) 1.47***, (0.1243) 1.47***, (0.1237) 1.45***, (0.1224) 1.48***, (0.1254)
    pseudo R^2 0.1219 0.1254 0.1462 0.145 0.131 0.1497
    LLR -146.49 -145.89 -142.43 -142.63 -144.96 -141.85
    BANK OF ENGLAND
    Δrt 1.19***, (0.2325) 1.04***, (0.2415) 1.05***, (0.2437) 1.15***, (0.2368) 1.07***, (0.2388) 1.07***, (0.2407)
    CBSIwt 0.26***, (0.0979) 0.28**, (0.1089)
    MON_POLt 0.37***, (0.1014) 0.41***, (0.1124)
    ΔGDPt -0.03, (0.1008) 0.1, (0.0923) 0.01, (0.0983)
    ΔINFt 0.09, (0.1034) -0.02, (0.0957) 0.06, (0.1031)
    -1/0 -1.55***, (0.1288) -1.63***, (0.1369) -1.64***, (0.1382) -1.56***, (0.1292) -1.59***, (0.1322) -1.59***, (0.133)
    0/1 1.7***, (0.1384) 1.83***, (0.1558) 1.84***, (0.158) 1.71***, (0.1393) 1.77***, (0.1481) 1.78***, (0.1494)
    pseudo R^2 0.106 0.1632 0.1663 0.1104 0.1359 0.1376
    LLR -112.1 -104.93 -104.54 -111.55 -108.35 -108.14
    EUROPEAN CENTRAL BANK
    Δrt 0.78***, (0.2195) 0.52**, (0.2339) 0.5**, (0.2353) 0.64***, (0.2277) 0.63***, (0.2278) 0.59**, (0.2303)
    CBSIwt 0.29***, (0.0956) 0.22*, (0.1168)
    MON_POLt 0.44***, (0.103) 0.42***, (0.1249)
    ΔGDPt 0.07, (0.11) 0.25***, (0.0929) 0.16, (0.1062)
    ΔINFt -0.05, (0.101) 0.03, (0.0955) -0.02, (0.1003)
    -1/0 -1.44***, (0.1198) -1.54***, (0.1291) -1.54***, (0.1293) -1.48***, (0.1235) -1.49***, (0.1236) -1.5***, (0.1247)
    0/1 1.55***, (0.1254) 1.71***, (0.146) 1.71***, (0.1452) 1.61***, (0.1314) 1.63***, (0.1352) 1.63***, (0.1353)
    pseudo R^2 0.0475 0.1212 0.1233 0.0788 0.0841 0.0917
    LLR -129.19 -119.19 -118.9 -124.95 -124.23 -123.19
    FEDERAL RESERVE
    Δrt 1.09***, (0.1856) 1.11***, (0.1873) 1***, (0.193) 0.97***, (0.1913) 1.08***, (0.1868) 0.99***, (0.1922)
    CBSIwt 0.2**, (0.0845) 0.15*, (0.0899)
    MON_POLt 0.19**, (0.0838) 0.16*, (0.0858)
    ΔGDPt 0.25**, (0.0985) 0.28***, (0.0969) 0.25**, (0.0997)
    ΔINFt -0.05, (0.0928) -0.05, (0.0925) -0.08, (0.0942)
    -1/0 -1.45***, (0.1223) -1.47***, (0.1237) -1.52***, (0.1301) -1.51***, (0.1295) -1.47***, (0.1232) -1.52***, (0.13)
    0/1 1.36***, (0.1146) 1.4***, (0.1188) 1.43***, (0.1223) 1.41***, (0.1194) 1.4***, (0.1196) 1.43***, (0.1223)
    pseudo R^2 0.11 0.1263 0.1467 0.1367 0.1268 0.1456
    LLR -148.27 -145.55 -142.16 -143.82 -145.46 -142.34
    The model is estimated by maximum likelihood. Robust standard errors are in parenthesis. Coefficients labelled with ***, **, * denote significance at 1%, 5%, 10% levels, respectively. The number of observations is 255.

     | Show Table
    DownLoad: CSV

    Notably, probit coefficients do not offer a direct measure of the contribution of explanatory variables to the dependent one, as it happens in linear models. Thus, we calculate the average marginal effects (hereafter AME) that Gelman and Hill (2006) called "average predicted probability" to assess the magnitude of the effects. We calculate the AME for model (Ⅰ) and (Ⅱ) to interpret the results shown in Table 4. For a better interpretation we standardize all continuous variables, therefore results reports the average marginal effects on the probability (of a rate cut, unchanged rate, rate hike) for a one standard deviation increase in the explanatory continuous variable, or for a unit change in discrete explanatory variables. Given the specification in model (Ⅰ), for the BoC the (not significant) probability of a policy rate hike goes up by 18% after a hike implemented in the last policy rate decision, where the probability of a rate cut increase by 18% after a unit decrease in Δrt. Given a positive (negative) unit change in last rate decision, a rate hike (cut) is more likely with 12% (16%) of probability for the BoE, 10% (11%) for the ECB, 18% (16%) for the Fed.

    Table 4.  Average marginal effects.
    BoC P(RC) P(NC) P(RH) BoE P(RC) P(NC) P(RH)
    (Ⅰ) (Ⅰ)
    Δrt -0.184*** 0.005 0.180*** Δrt -0.160*** 0.038 0.121***
    (Ⅱ) (Ⅱ)
    Δrt -0.179*** 0.005 0.173*** Δrt -0.134*** 0.038 0.096***
    MON_POLt -0.015 0.000 0.014 MON_POLt -0.048*** 0.014 0.034***
    (Ⅲ) (Ⅲ)
    Δrt -0.164*** 0.003 0.161*** Δrt -0.135*** 0.039 0.097***
    MON_POLt -0.008 0.000 0.008 MON_POLt -0.052*** 0.015 0.037***
    ΔGDPt -0.043*** 0.001 0.042** ΔGDPt 0.003 -0.001 -0.002
    ΔINFt 0.027* -0.001 -0.027* ΔINFt -0.011 0.003 0.008
    ECB P(RC) P(NC) P(RH) Fed P(RC) P(NC) P(RH)
    (Ⅰ) (Ⅰ)
    Δrt -0.116*** 0.019 0.098*** Δrt -0.161*** -0.027 0.187***
    (Ⅱ) (Ⅱ)
    Δrt -0.073** 0.016 0.058** Δrt -0.162*** -0.022 0.183***
    MON_POLt -0.062*** 0.013 0.049*** MON_POLt -0.028** -0.004 0.032**
    (Ⅲ) (Ⅲ)
    Δrt -0.071** 0.015 0.056** Δrt -0.138*** -0.025 0.163***
    MON_POLt -0.059*** 0.012 0.047*** MON_POLt -0.022* -0.004 0.025*
    ΔGDPt -0.009 0.002 0.008 ΔGDPt -0.035*** -0.006 0.041**
    ΔINFt 0.008 -0.002 -0.006 ΔINFt 0.007 0.001 -0.008
    Coefficients are labelled with ***, **, * and denote significance at 1%, 5%, 10% levels, respectively. Probabilities of RC = Rate Cut; NC = No Change, RH = Rate Hike.

     | Show Table
    DownLoad: CSV

    In the MON_POL specification (Ⅱ), we find that the a one-standard deviation increase (decrease) change in monetary policy communication increases on average the probability of a rate hike (rate cut) by 1% for the BoC, 5% (3%) for the BoE, 5% (6%) for the ECB, 3% for the Fed. When adding our communication variable, we find that the marginal contribution of most recent policy rate decision decreases at least for the ECB and the BoE, as partial information is captured by the central banks' monetary policy indicator. Finally, results hold for the specification (Ⅲ), where controlling for macroeconomic variables results show significant average marginal effects.

    As a check, we run the same probit technique with an alternative specification by excluding the period under the Effective Lower Bound (ELB) environment for the BoE, the ECB and the Fed. We cut our samples respectively by March, 2009, by June, 2014 and from August 2011 to December 2012 Benamar et al. (2021). Tables A3-A4 in Appendix show the results of the alternative specification for the probit regression and the AME results, respectively. Results hold when excluding the ELB period, in the model specifications employed. However, we notice that when employing CBSIw as a sentiment indicator, the Taylor-like specification (Ⅵ) for the ECB loses significance. Our interpretation is that since the ECB dataset is the most heterogeneous in terms of topics amount, the MON_POL index is more likely to capture central bank communication focused on interest rates changes. As expected, we find an increase in the average marginal effects for one-standard deviation increase (decrease) of MON_POL. Given the model (Ⅱ), for the BoE the AME doubled from 5% to 10% (rate cut) and from 3% to 7% (rate hike); for the ECB, from 6% to 8% (rate cut) and from 5% to 7% (rate hike), while for the Fed is unchanged (3%) respective of a rate cut and a rate hike. This result might suggest that the adoption of unconventional policies breaks the relationship between key policy rates and the forward guidance given in speeches and releases.

    Central banks communication is strongly associated with a number of critical decisions, therefore the ability to rapidly monitor such information is potentially useful for policymakers and market participants. The literature acknowledges text analysis as a useful technique, as it decodes information embedded in documents into quantitative indicators. However, text analysis does not discriminate among the contexts in which the same words can be used.

    In this paper, we develop a combination of machine learning and text analysis techniques to build a topic-weighted Central Bank Sentiment Index for four central banks (BoC, BoE, ECB, Fed). Using a large dataset that includes speeches and press releases, we uncover the latent topics running the LDA algorithm. The main advantage of LDA is that, once the model is trained, it can predict topics for new unseen documents and it provides an intuitive tool for real time analysis. However, the model training requires careful fine-tuning to obtain good results. For this reason, we use a gridsearch approach for the optimal choice of the hyperparameters.

    After labelling the dataset according to the identified topics for each bank, we build a topic sentiment index for each bank, according to the methodology proposed in Priola et al. (2021). As a last step, we aggregate the sentiment indexes in a topic weighted Central Bank Sentiment Index. We show that our CBSIw for the four Central Banks share common trends, decreasing during the Great Recession and during the COVID-19 crisis; moreover, we find a strong positive correlation between the indicator and the policy rates. Amongst the sub-indicators that compose the CBSIw, we find that the major driver is monetary policy, followed by macroprudential policy and financial stability. Moreover, all banks draw the attention to common subjects such as digitalization and fintech, but there are also bank-specific topics. For the BoC, International Affairs is the third leading subject that supports the evidence of the increasing importance of global competitiveness after the subprime crisis in 2007. For the BoE, digitalization and fintech innovation for payments is important, as shown by the increasing numbers of related speeches in the last years of the sample. For the first ten years in the sample, the ECB focused on issues related to the development of the European Monetary Union, proxied by European Affairs, and stressed the importance of financial integration. We also find an increasing frequency of climate change and digitalization contents, one of the new ECB commitments stated by Christine Lagarde during the Opening Statement to the Economic and Monetary Affairs Committee, in September 2019. For the Fed, among other topics, we find that the central bank promotes the protection of consumers by publishing speeches about the importance of financial education and community development.

    Next, we investigate the predictiveness power of the sentiment indicators by employing a probit model. We test the monetary policy sub-index contents as well as and the weighted index. We find that coefficients are positive and significant, suggesting that the MON_POL index helps to predict future changes in policy rates, proxied by a ternary variable taking values of -1, 0, 1 (respectively a rate cut, unchanged rate, rate hike). The results hold when adding macroeconomic variables with a Taylor rule-like framework. According to the average marginal effects, we find that a rate hike is more likely to happen with a one standard deviation change in monetary policy communication, as well as a rate cut with a downward change of the same index. We add a preliminary investigation on the effective lower bound, with results that suggest that the MON_POL index increases its explanatory power when excluding the ELB period.

    We believe that further research may be relevant. First, the importance of unconventional monetary policies might have caused structural changes in the relationship between central banks' communication and monetary policy decisions. Therefore, an effort to identify this potential break is required with a further investigation on new monetary policy stance indicators that better proxy forward guidance during the period of unconventional monetary policy. Second, with the growing importance of social media for political and financial communication, there is an emerging field of analysis that links central banking news and the sentiment reaction of market participants that might be worth exploring.

    All authors declare no conflicts of interest in this paper.

    Figure 11.  BoC word cloud.
    A larger font indicates that a term is used with a higher frequency in its respective text corpus.
    Figure 12.  BoE word cloud.
    A larger font indicates that a term is used with a higher frequency in its respective text corpus.
    Figure 13.  ECB word cloud.
    A larger font indicates that a term is used with a higher frequency in its respective text corpus.
    Figure 14.  Fed word cloud.
    A larger font indicates that a term is used with a higher frequency in its respective text corpus.
    Figure 15.  BoC Zipf curve.
    Figure 16.  BoE Zipf curve.
    Figure 17.  ECB Zipf curve.
    Figure 18.  Fed Zipf curve.
    Table 5.  BoC descriptive statistics of CBSI weighted and its sub-indicators.
    BOC avg med std obs cor max min
    CBSIw -0.18 -0.19 0.19 1007 1.00 0.47 -0.61
    CLIMATE 0.20 0.20 0.35 2 -1.00 0.56 -0.15
    FIS_POL 0.29 0.29 0.10 6 1.00 0.39 0.19
    INNOVAT -0.04 -0.07 0.19 28 0.44 0.43 -0.36
    INT_AFF -0.22 -0.23 0.23 102 0.69 0.30 -0.60
    MACPRUD -0.23 -0.27 0.26 182 0.83 0.64 -0.72
    MISCELL 0.19 0.08 0.16 3 0.35 0.41 0.07
    MON_POL -0.19 -0.20 0.20 668 0.82 0.33 -0.80
    PAYSECS 0.00 -0.04 0.28 16 0.87 0.37 -0.47

     | Show Table
    DownLoad: CSV
    Table 6.  BoE descriptive statistics of CBSI weighted and its sub-indicators.
    BOE avg med std obs cor max min
    CBSIw -0.30 -0.31 0.11 1319 1.00 0.04 -0.62
    MACPRUD -0.33 -0.34 0.15 403 0.69 0.22 -0.62
    INT_AFF -0.28 -0.29 0.18 50 0.53 0.15 -0.59
    MON_POL -0.32 -0.32 0.12 788 0.84 0.08 -0.62
    MISCELL -0.12 -0.14 0.20 8 0.52 0.20 -0.35
    PAYSECS -0.07 -0.05 0.19 70 0.75 0.20 -0.51

     | Show Table
    DownLoad: CSV
    Table 7.  ECB descriptive statistics of CBSI weighted and its sub-indicators.
    ECB avg med std obs cor max min
    CBSIw -0.01 -0.03 0.17 2306 0.98 0.40 -0.47
    CLIMATE -0.24 -0.30 0.14 9 -0.20 0.02 -0.39
    EMU_AFF 0.17 0.18 0.19 151 0.30 0.67 -0.42
    FINTEGR 0.31 0.33 0.19 140 0.19 0.77 -0.25
    MACPRUD -0.10 -0.11 0.20 414 0.42 0.54 -0.59
    FIS_POL 0.06 0.02 0.27 4 0.93 0.43 -0.25
    INNOVAT 0.17 0.15 0.20 43 0.56 0.71 -0.29
    INT_AFF 0.02 0.03 0.22 80 0.67 0.46 -0.43
    MON_POL -0.06 -0.07 0.18 1345 0.85 0.39 -0.47
    MISCELL 0.16 0.15 0.30 5 0.89 0.56 -0.21
    PAYSECS 0.26 0.22 0.24 115 0.36 0.88 -0.37

     | Show Table
    DownLoad: CSV
    Table 8.  Fed descriptive statistics of CBSI weighted and its sub-indicators.
    FED avg med std obs cor max min
    CBSIw - 0.22 -0.23 0.12 1505 1.00 0.12 -0.47
    CONSCOM - 0.12 -0.13 0.24 225 0.71 0.56 -0.76
    INNOVAT - 0.05 -0.09 0.20 45 0.31 0.42 -0.50
    INT_AFF - 0.15 -0.18 0.16 67 0.38 0.21 -0.57
    MACPRUD - 0.24 -0.26 0.17 446 0.67 0.26 -0.65
    MISCELL - 0.06 -0.06 0.00 1 n.a. -0.06 -0.06
    MON_POL - 0.24 -0.26 0.16 686 0.69 0.34 -0.65
    PAYSECS - 0.06 -0.07 0.18 35 0.33 0.38 -0.49
    MON_POL -0.06 -0.07 0.18 1345 0.85 0.39 -0.47
    MISCELL 0.16 0.15 0.30 5 0.89 0.56 -0.21
    PAYSECS 0.26 0.22 0.24 115 0.36 0.88 -0.37

     | Show Table
    DownLoad: CSV
    Table 9.  Word-topic distribution for the Bank of Canada.
    TOPIC WORDS
    MACPRUD ('0.043*"mortgag" + 0.025*"market" + 0.020*"hous" + 0.009*"guidelin" + 0.008*"rate" + 0.008*"choic" + 0.008*"adjust" + 0.007*"flexibl" + 0.007*"chang" + 0.007*"share"')
    MACPRUD ('0.042*"bank" + 0.034*"financi" + 0.024*"risk" + 0.019*"global" + 0.017*"will" + 0.016*"crisi" + 0.015*"market" + 0.015*"capit" + 0.014*"reform" + 0.012*"requir"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"patch" + 0.000*"envis"')
    MON_POL ('0.031*"price" + 0.031*"econom" + 0.015*"import" + 0.015*"commod" + 0.015*"global" + 0.013*"trade" + 0.012*"export" + 0.012*"increas" + 0.011*"countri" + 0.011*"commod_price"')
    FIS_POL ('0.020*"risk" + 0.019*"plan" + 0.016*"pension" + 0.015*"infrastructur" + 0.014*"invest" + 0.012*"individu" + 0.012*"effici" + 0.011*"defin_benefit" + 0.011*"pension_plan" + 0.011*"sponsor"')
    MON_POL ('0.149*"bank" + 0.047*"rate" + 0.032*"project" + 0.032*"econom" + 0.017*"target_overnight" + 0.014*"inflat" + 0.013*"outlook" + 0.011*"announc_overnight" + 0.010*"target" + 0.010*"report_updat"')
    MON_POL ('0.100*"polici" + 0.090*"fiscal" + 0.044*"econom" + 0.033*"monetari" + 0.029*"monetari_polici" + 0.027*"inflat" + 0.024*"bank" + 0.020*"rate" + 0.019*"govern" + 0.017*"effect"')
    INT_AFF ('0.015*"inequ" + 0.010*"brazil" + 0.006*"million" + 0.004*"incom" + 0.003*"unemploy" + 0.003*"coeffici" + 0.002*"equal" + 0.001*"gain" + 0.001*"cycl" + 0.001*"educ"')
    PAYSECS ('0.020*"payment" + 0.006*"futur" + 0.006*"oversight" + 0.006*"need" + 0.005*"will" + 0.005*"vision" + 0.004*"import" + 0.004*"payment_system" + 0.004*"growth" + 0.004*"user"')
    MACPRUD ('0.067*"market" + 0.054*"liquid" + 0.032*"bank" + 0.023*"financi" + 0.020*"asset" + 0.018*"will" + 0.017*"credit" + 0.015*"fund" + 0.013*"term" + 0.013*"central_bank"')
    MON_POL ('0.036*"potenti_output" + 0.030*"growth" + 0.021*"econom" + 0.021*"product" + 0.015*"will" + 0.013*"output" + 0.011*"inflat" + 0.011*"trend_labour" + 0.009*"input" + 0.008*"trend"')
    MISCELL ('0.000*"inflat" + 0.000*"bank" + 0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"expect" + 0.000*"terror" + 0.000*"price" + 0.000*"econom" + 0.000*"underpin"')
    MON_POL ('0.026*"inflat" + 0.020*"rate" + 0.020*"wage" + 0.012*"lower" + 0.012*"labour_market" + 0.012*"polici" + 0.010*"growth" + 0.010*"expect" + 0.009*"econom" + 0.009*"compani"')
    INT_AFF ('0.016*"open" + 0.014*"tariff" + 0.008*"foreign" + 0.008*"confeder" + 0.007*"year" + 0.007*"peopl" + 0.007*"immigr" + 0.006*"world" + 0.006*"trade" + 0.006*"invest"')
    INT_AFF ('0.026*"foreign_exchang" + 0.020*"survey" + 0.017*"transact" + 0.016*"tabl" + 0.014*"turnov" + 0.013*"rate" + 0.012*"dollar" + 0.012*"currenc" + 0.010*"bank" + 0.010*"increas"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"bank" + 0.000*"underpin" + 0.000*"econom" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath"')
    MON_POL ('0.011*"lower" + 0.010*"resourc" + 0.005*"price" + 0.005*"shock" + 0.005*"seen" + 0.005*"profil" + 0.005*"import" + 0.005*"variabl" + 0.005*"higher" + 0.004*"pose"')
    MON_POL ('0.003*"bond_yield" + 0.001*"attempt" + 0.001*"logic" + 0.001*"elect" + 0.001*"deflat" + 0.001*"long_stand" + 0.001*"underperform" + 0.001*"expos_intern" + 0.001*"peril" + 0.001*"transit_period"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"patch" + 0.000*"envis"')
    MISCELL ('0.107*"governor" + 0.095*"said" + 0.020*"bank" + 0.017*"econom" + 0.009*"professor" + 0.008*"intern" + 0.007*"univers" + 0.006*"ad" + 0.006*"governor_told" + 0.006*"school"')
    MON_POL ('0.001*"york" + 0.001*"rebuild_phase" + 0.001*"took_place" + 0.001*"emphas" + 0.001*"comparison" + 0.001*"remain_stimul" + 0.001*"absolut_essenti" + 0.001*"junctur" + 0.001*"headwind" + 0.001*"truli"')
    INT_AFF ('0.043*"growth" + 0.043*"export" + 0.039*"product" + 0.030*"invest" + 0.026*"trade" + 0.021*"busi" + 0.018*"econom" + 0.017*"global" + 0.016*"recess" + 0.016*"firm"')
    MON_POL ('0.075*"econom" + 0.045*"growth" + 0.036*"expect" + 0.034*"will" + 0.026*"bank" + 0.024*"inflat" + 0.020*"global" + 0.019*"rate" + 0.017*"project" + 0.014*"outlook"')
    MON_POL ('0.059*"econom" + 0.040*"polici" + 0.030*"countri" + 0.025*"global" + 0.020*"market" + 0.017*"need" + 0.015*"adjust" + 0.015*"financi" + 0.014*"intern" + 0.014*"import"')
    INNOVAT ('0.026*"bank" + 0.017*"will" + 0.016*"money" + 0.016*"peopl" + 0.014*"need" + 0.013*"work" + 0.011*"issu" + 0.011*"busi" + 0.010*"innov" + 0.010*"payment"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"econom" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"bank"')
    CLIMATE ('0.026*"climat_chang" + 0.019*"risk" + 0.016*"climat" + 0.015*"bank" + 0.011*"financi" + 0.010*"climat_relat" + 0.008*"design" + 0.007*"scenario" + 0.006*"clear_settlement" + 0.006*"transit"')
    PAYSECS ('0.019*"financi" + 0.009*"cyber_secur" + 0.008*"threat" + 0.008*"cyber" + 0.007*"payment" + 0.005*"coordin" + 0.005*"protect" + 0.005*"particip" + 0.005*"collabor" + 0.005*"cyber_attack"')
    MON_POL ('0.034*"econom" + 0.023*"will" + 0.022*"worker" + 0.018*"labour_market" + 0.016*"technolog" + 0.015*"work" + 0.015*"peopl" + 0.012*"skill" + 0.012*"sector" + 0.011*"product"')
    MISCELL ('0.000*"intensifi" + 0.000*"precari" + 0.000*"governor_told" + 0.000*"deputi" + 0.000*"senior_deputi" + 0.000*"immedi" + 0.000*"downsid" + 0.000*"audienc" + 0.000*"host" + 0.000*"choic"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"patch" + 0.000*"envis"')
    MACPRUD ('0.057*"market" + 0.039*"financi" + 0.023*"bank" + 0.018*"effici" + 0.012*"risk" + 0.011*"rate" + 0.011*"need" + 0.011*"regul" + 0.011*"will" + 0.009*"investor"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"patch" + 0.000*"envis"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"patch" + 0.000*"envis"')
    PAYSECS ('0.093*"note" + 0.071*"bank" + 0.022*"counterfeit" + 0.012*"confid" + 0.011*"payment" + 0.009*"imag" + 0.009*"woman" + 0.008*"seri" + 0.008*"cash" + 0.007*"will"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"patch" + 0.000*"envis"')
    MACPRUD ('0.030*"vulner" + 0.030*"hous" + 0.030*"market" + 0.012*"continu" + 0.011*"price" + 0.010*"hous_price" + 0.009*"household_indebted" + 0.008*"risk" + 0.007*"rise" + 0.006*"report"')
    INNOVAT ('0.034*"senior_deputi" + 0.012*"deputi_governor" + 0.010*"read_content" + 0.009*"speech_appear" + 0.009*"type_press" + 0.009*"read" + 0.009*"summari_topic" + 0.009*"watch" + 0.008*"summari" + 0.007*"watch_deputi"')
    MON_POL ('0.044*"inflat" + 0.038*"econom" + 0.029*"monetari_polici" + 0.027*"bank" + 0.024*"rate" + 0.023*"inflat_target" + 0.021*"target" + 0.019*"polici" + 0.016*"price" + 0.012*"will"')
    MON_POL ('0.024*"spend" + 0.023*"household" + 0.017*"incom" + 0.015*"recoveri" + 0.014*"save" + 0.013*"will" + 0.011*"servic" + 0.011*"pandem" + 0.010*"consumpt" + 0.010*"econom"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"patch" + 0.000*"envis"')
    MON_POL ('0.029*"polici" + 0.026*"central_bank" + 0.019*"econom" + 0.019*"rate" + 0.016*"growth" + 0.011*"forward_guidanc" + 0.010*"effect" + 0.008*"financi" + 0.008*"will" + 0.008*"crisi"')
    MISCELL ('0.000*"room_improv" + 0.000*"transmit" + 0.000*"monetari_union" + 0.000*"meant" + 0.000*"extraordinari" + 0.000*"essenti_element" + 0.000*"said" + 0.000*"self_reinforc" + 0.000*"withdrawn" + 0.000*"diminish"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"patch" + 0.000*"envis"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"endors" + 0.000*"aftermath" + 0.000*"patch" + 0.000*"envis"')
    PAYSECS ('0.038*"payment" + 0.023*"payment_system" + 0.016*"system" + 0.012*"risk" + 0.008*"modern" + 0.008*"valu" + 0.007*"settl" + 0.004*"retail_payment" + 0.003*"liquid" + 0.002*"bank"')
    MON_POL ('0.033*"program" + 0.032*"econom" + 0.027*"pandem" + 0.024*"recoveri" + 0.022*"support" + 0.019*"will" + 0.018*"bank" + 0.016*"busi" + 0.015*"covid" + 0.011*"polici"')
    INT_AFF ('0.013*"price" + 0.013*"retail" + 0.006*"unit_state" + 0.005*"market" + 0.004*"exchang_rate" + 0.004*"cost" + 0.004*"good" + 0.004*"differenti" + 0.004*"factor" + 0.004*"retail_sector"')
    MISCELL ('0.000*"tragic_event" + 0.000*"duti" + 0.000*"bank" + 0.000*"risk" + 0.000*"financi" + 0.000*"preserv" + 0.000*"terror" + 0.000*"underpin" + 0.000*"attitud" + 0.000*"econom"')
    MON_POL ('0.006*"client" + 0.005*"trust" + 0.003*"employe" + 0.002*"reward" + 0.002*"banker" + 0.002*"diminish" + 0.002*"relianc" + 0.002*"widespread" + 0.002*"connect" + 0.002*"restor_trust"')
    INT_AFF ('0.034*"econom" + 0.027*"global" + 0.020*"will" + 0.019*"financi" + 0.018*"countri" + 0.013*"market" + 0.012*"polici" + 0.011*"need" + 0.010*"growth" + 0.010*"world"')
    MACPRUD ('0.051*"hous" + 0.021*"rate" + 0.020*"mortgag" + 0.019*"debt" + 0.019*"hous_price" + 0.017*"polici" + 0.015*"chart" + 0.012*"household" + 0.010*"market" + 0.010*"borrow"')
    MACPRUD ('0.060*"financi" + 0.032*"risk" + 0.025*"econom" + 0.025*"bank" + 0.018*"crisi" + 0.017*"financi_stabil" + 0.016*"vulner" + 0.013*"will" + 0.013*"monetari_polici" + 0.012*"polici"')
    MON_POL ('0.059*"econom" + 0.028*"adjust" + 0.020*"will" + 0.017*"demand" + 0.017*"monetari_polici" + 0.016*"rate" + 0.014*"bank" + 0.014*"chang" + 0.013*"help" + 0.013*"product"')
    MON_POL ('0.076*"inflat" + 0.032*"price" + 0.031*"measur" + 0.029*"expect" + 0.021*"bank" + 0.014*"rate" + 0.013*"econom" + 0.013*"consum" + 0.011*"percept" + 0.010*"household"')]

     | Show Table
    DownLoad: CSV
    Table 10.  Word-topic distribution for the Bank of England.
    TOPIC WORDS
    MON_POL ('0.020*"econom" + 0.018*"work" + 0.014*"employ" + 0.013*"rate" + 0.013*"unemploy" + 0.010*"survey" + 0.010*"worker" + 0.010*"data" + 0.009*"self_employ" + 0.008*"firm"')
    MACPRUD ('0.079*"bank" + 0.022*"rate" + 0.019*"market" + 0.016*"asset" + 0.014*"credit" + 0.014*"lend" + 0.014*"financi" + 0.013*"fund" + 0.010*"liquid" + 0.009*"econom"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MON_POL ('0.058*"product" + 0.038*"growth" + 0.016*"capit" + 0.016*"econom" + 0.016*"invest" + 0.012*"sector" + 0.010*"measur" + 0.010*"compani" + 0.009*"servic" + 0.009*"labour"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MON_POL ('0.042*"econom" + 0.027*"inflat" + 0.021*"growth" + 0.017*"price" + 0.016*"will" + 0.013*"rate" + 0.010*"demand" + 0.010*"global" + 0.010*"monetari_polici" + 0.009*"busi"')
    INT_AFF ('0.022*"financi" + 0.019*"econom" + 0.019*"bank" + 0.017*"global" + 0.016*"countri" + 0.013*"crisi" + 0.012*"chart" + 0.010*"intern" + 0.010*"asset" + 0.008*"sourc"')
    MON_POL ('0.037*"growth" + 0.027*"inflat" + 0.024*"bank" + 0.019*"expect" + 0.013*"market" + 0.013*"recent" + 0.013*"polici" + 0.012*"financi" + 0.012*"increas" + 0.011*"monetari_polici"')
    MON_POL ('0.031*"growth" + 0.028*"rate" + 0.024*"inflat" + 0.020*"price" + 0.018*"increas" + 0.016*"expect" + 0.013*"fallen" + 0.011*"committe" + 0.011*"survey" + 0.010*"remain"')
    MISCELL ('0.000*"union" + 0.000*"acut_awar" + 0.000*"threshold" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier"')
    MACPRUD ('0.026*"equiti" + 0.026*"compani" + 0.024*"sharehold" + 0.016*"valu" + 0.015*"share" + 0.013*"short" + 0.011*"investor" + 0.010*"corpor" + 0.010*"dividend" + 0.007*"return"')
    MON_POL ('0.034*"expect" + 0.029*"inflat" + 0.011*"survey" + 0.009*"percept" + 0.007*"ration" + 0.006*"bank" + 0.006*"form" + 0.006*"respons" + 0.005*"column" + 0.005*"educ"')
    MACPRUD ('0.078*"market" + 0.021*"risk" + 0.021*"financi" + 0.013*"liquid" + 0.012*"trade" + 0.011*"manag" + 0.010*"will" + 0.008*"asset" + 0.008*"secur" + 0.007*"bank"')
    MON_POL ('0.040*"rate" + 0.019*"inflat" + 0.017*"econom" + 0.013*"expect" + 0.012*"growth" + 0.011*"monetari_polici" + 0.011*"will" + 0.011*"polici" + 0.010*"real" + 0.010*"household"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MACPRUD ('0.002*"ship" + 0.002*"centuri" + 0.001*"marin" + 0.001*"disput" + 0.001*"affair" + 0.001*"gambl" + 0.001*"enemi" + 0.001*"despit" + 0.001*"council" + 0.001*"life_assur"')
    MACPRUD ('0.040*"bank" + 0.035*"risk" + 0.027*"financi" + 0.018*"regul" + 0.017*"will" + 0.013*"capit" + 0.010*"crisi" + 0.010*"financi_stabil" + 0.009*"need" + 0.009*"market"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    PAYSECS ('0.038*"bank" + 0.028*"payment" + 0.019*"will" + 0.012*"servic" + 0.011*"technolog" + 0.010*"cash" + 0.010*"chang" + 0.009*"note" + 0.009*"money" + 0.009*"oper"')
    MACPRUD ('0.036*"firm" + 0.029*"bank" + 0.023*"will" + 0.018*"resolut" + 0.016*"insur" + 0.013*"group" + 0.013*"supervis" + 0.011*"need" + 0.011*"solvenc" + 0.011*"regim"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MON_POL ('0.026*"unemploy" + 0.024*"inflat" + 0.019*"wage" + 0.019*"rate" + 0.015*"chang" + 0.015*"price" + 0.010*"employ" + 0.010*"effect" + 0.010*"econom" + 0.010*"labour"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MISCELL ('0.021*"gender" + 0.016*"ethnic_minor" + 0.014*"ethnic" + 0.013*"woman" + 0.009*"surveil" + 0.009*"femal" + 0.006*"group" + 0.005*"qualif" + 0.005*"sampl" + 0.005*"white"')
    MISCELL ('0.000*"self_employ" + 0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"econom" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"rate" + 0.000*"seduct" + 0.000*"uncertainti"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MON_POL ('0.031*"polici" + 0.028*"monetari_polici" + 0.027*"inflat" + 0.019*"price" + 0.018*"central_bank" + 0.015*"econom" + 0.015*"rate" + 0.014*"monetari" + 0.013*"inflat_target" + 0.011*"target"')
    MON_POL ('0.024*"bank" + 0.015*"public" + 0.014*"econom" + 0.009*"central_bank" + 0.009*"peopl" + 0.009*"time" + 0.009*"committe" + 0.008*"work" + 0.007*"polici" + 0.007*"institut"')
    MACPRUD ('0.023*"fair_valu" + 0.008*"valuat" + 0.006*"account" + 0.005*"mark" + 0.001*"foul" + 0.001*"misalign" + 0.001*"thrift" + 0.001*"hidden_reserv" + 0.001*"sight_deposit" + 0.001*"yardstick"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')
    MISCELL ('0.000*"acut_awar" + 0.000*"violat" + 0.000*"curios" + 0.000*"view_express" + 0.000*"pledg" + 0.000*"seduct" + 0.000*"comprehens_packag" + 0.000*"tier_tier" + 0.000*"bucket" + 0.000*"longstand"')]

     | Show Table
    DownLoad: CSV
    Table 11.  Word-topic distribution for the European Central Bank.
    TOPIC WORDS
    MACPRUD ('0.068*"risk" + 0.051*"bank" + 0.020*"financi" + 0.019*"manag" + 0.019*"regul" + 0.012*"sector" + 0.012*"resili" + 0.011*"capit" + 0.010*"requir" + 0.010*"stress_test"')
    MACPRUD ('0.008*"dilemma" + 0.008*"liquid" + 0.005*"incent" + 0.004*"dollar" + 0.004*"core" + 0.003*"longer" + 0.003*"gold" + 0.003*"shortag" + 0.003*"privat" + 0.002*"suppli"')
    MON_POL ('0.011*"spread" + 0.009*"autonom_factor" + 0.007*"forecast" + 0.006*"eurosystem" + 0.005*"short_term" + 0.004*"reserv" + 0.004*"liquid" + 0.003*"rate" + 0.003*"requir" + 0.003*"benchmark"')
    MON_POL ('0.046*"exchang_rate" + 0.022*"polici" + 0.012*"countri" + 0.012*"market" + 0.011*"euro_area" + 0.010*"rate" + 0.009*"effect" + 0.009*"econom" + 0.008*"eurogroup" + 0.008*"regim"')
    CLIMATE ('0.040*"climat_chang" + 0.027*"climat" + 0.024*"green" + 0.019*"risk" + 0.015*"climat_relat" + 0.012*"econom" + 0.012*"will" + 0.011*"environment" + 0.011*"transit" + 0.010*"sustain"')
    MISCELL ('0.002*"build" + 0.002*"permit" + 0.001*"renov" + 0.001*"preserv" + 0.001*"landscap" + 0.001*"energi" + 0.001*"annex" + 0.001*"construct" + 0.001*"collabor" + 0.001*"past_year"')
    EMU_AFF ('0.027*"design" + 0.013*"european" + 0.012*"europ" + 0.011*"citi" + 0.011*"competit" + 0.007*"invoic" + 0.007*"phase" + 0.006*"build" + 0.006*"euro" + 0.006*"opportun"')
    PAYSECS ('0.132*"euro" + 0.033*"banknot" + 0.025*"banknot_coin" + 0.022*"euro_area" + 0.021*"public" + 0.016*"cash_changeov" + 0.015*"introduct" + 0.014*"nation" + 0.012*"cash" + 0.011*"currenc"')
    PAYSECS ('0.106*"payment" + 0.059*"sepa" + 0.025*"scheme" + 0.024*"retail_payment" + 0.019*"innov" + 0.019*"servic" + 0.019*"bank" + 0.017*"custom" + 0.017*"payment_servic" + 0.015*"card"')
    MON_POL ('0.031*"econom" + 0.020*"euro_area" + 0.017*"growth" + 0.017*"price" + 0.017*"rate" + 0.014*"develop" + 0.014*"price_stabil" + 0.012*"inflat" + 0.012*"monetari_polici" + 0.010*"polici"')
    MON_POL ('0.006*"regul" + 0.006*"safe" + 0.005*"short_term" + 0.004*"swap_spread" + 0.004*"bond" + 0.004*"german" + 0.003*"swap" + 0.003*"flight_safeti" + 0.003*"widen" + 0.003*"grow"')
    PAYSECS ('0.060*"market" + 0.021*"secur" + 0.021*"infrastructur" + 0.019*"eurosystem" + 0.011*"oper" + 0.011*"servic" + 0.011*"financi" + 0.011*"particip" + 0.010*"euro" + 0.010*"collater"')
    MON_POL ('0.055*"monetari_polici" + 0.023*"polici" + 0.020*"communic" + 0.019*"decis" + 0.018*"strategi" + 0.015*"bank" + 0.015*"independ" + 0.014*"econom" + 0.013*"market" + 0.013*"public"')
    MISCELL ('0.010*"work" + 0.007*"time" + 0.006*"look" + 0.005*"construct" + 0.004*"build" + 0.004*"challeng" + 0.004*"peopl" + 0.004*"like" + 0.004*"event" + 0.004*"basic"')
    MON_POL ('0.082*"financi" + 0.037*"stabil" + 0.028*"asset" + 0.022*"price" + 0.021*"polici" + 0.021*"crisi" + 0.021*"risk" + 0.020*"monetari_polici" + 0.013*"econom" + 0.011*"price_stabil"')
    MISCELL ('0.003*"legitimaci" + 0.002*"evolut" + 0.001*"summit" + 0.001*"zero" + 0.001*"brief" + 0.001*"disput" + 0.001*"gather" + 0.001*"steer" + 0.001*"secretariat" + 0.001*"onset_crisi"')
    MON_POL ('0.034*"econom" + 0.034*"household" + 0.023*"countri" + 0.023*"incom" + 0.022*"euro_area" + 0.015*"china" + 0.010*"inequ" + 0.010*"european" + 0.009*"unit_state" + 0.009*"increas"')
    FIS_POL ('0.007*"fiscal" + 0.004*"debt_ratio" + 0.003*"ship" + 0.003*"region" + 0.003*"qualiti" + 0.003*"statut" + 0.002*"polici" + 0.002*"debt" + 0.002*"rule" + 0.002*"refer_valu"')
    INT_AFF ('0.004*"bank" + 0.002*"rate" + 0.002*"cash" + 0.002*"base" + 0.002*"commerci" + 0.001*"deposit" + 0.001*"provid" + 0.001*"account" + 0.001*"remuner" + 0.001*"differ"')
    INT_AFF ('0.090*"euro" + 0.072*"currenc" + 0.041*"countri" + 0.038*"exchang_rate" + 0.037*"intern" + 0.025*"euro_area" + 0.022*"region" + 0.018*"dollar" + 0.017*"role" + 0.013*"trade"')
    INNOVAT ('0.062*"bank" + 0.031*"digit" + 0.025*"cash" + 0.024*"base" + 0.020*"money" + 0.019*"deposit" + 0.019*"technolog" + 0.017*"rate" + 0.014*"commerci" + 0.013*"negat"')
    MON_POL ('0.000*"polici" + 0.000*"euro_area" + 0.000*"purchas" + 0.000*"price" + 0.000*"rate" + 0.000*"secur" + 0.000*"inflat" + 0.000*"monetari_polici" + 0.000*"market" + 0.000*"econom"')
    MON_POL ('0.038*"monetari" + 0.026*"money" + 0.026*"price" + 0.026*"inflat" + 0.025*"price_stabil" + 0.024*"econom" + 0.022*"growth" + 0.021*"analysi" + 0.021*"monetari_polici" + 0.017*"develop"')
    FINTEGR ('0.051*"integr" + 0.037*"financi" + 0.035*"market" + 0.033*"bank" + 0.029*"european" + 0.014*"cross_border" + 0.012*"nation" + 0.011*"europ" + 0.010*"singl" + 0.010*"servic"')
    MON_POL ('0.067*"rate" + 0.021*"polici" + 0.020*"monetari_polici" + 0.016*"inflat" + 0.016*"effect" + 0.015*"expect" + 0.014*"euro_area" + 0.012*"econom" + 0.012*"asset_purchas" + 0.010*"negat"')
    MACPRUD ('0.000*"rate" + 0.000*"bank" + 0.000*"market" + 0.000*"financi" + 0.000*"excess_liquid" + 0.000*"euro_area" + 0.000*"liquid" + 0.000*"increas" + 0.000*"level" + 0.000*"german"')
    MACPRUD ('0.057*"financi" + 0.053*"market" + 0.031*"bank" + 0.024*"liquid" + 0.022*"risk" + 0.012*"credit" + 0.010*"oper" + 0.010*"fund" + 0.009*"increas" + 0.009*"sector"')
    MON_POL ('0.031*"bank" + 0.006*"chang" + 0.006*"monetari_polici" + 0.006*"smaller" + 0.005*"billion" + 0.005*"small" + 0.004*"time" + 0.004*"comprehens_assess" + 0.004*"deleverag" + 0.004*"come"')
    MACPRUD ('0.064*"market" + 0.021*"financi" + 0.018*"risk" + 0.015*"govern_bond" + 0.014*"euro_area" + 0.013*"govern" + 0.011*"rate" + 0.011*"bond" + 0.011*"price" + 0.010*"spread"')
    EMU_AFF ('0.040*"trust" + 0.017*"faith" + 0.013*"legal_tender" + 0.010*"cash" + 0.008*"money" + 0.008*"status" + 0.007*"protect" + 0.006*"limit" + 0.006*"promis" + 0.004*"peopl"')
    EMU_AFF ('0.036*"europ" + 0.031*"cultur" + 0.018*"european" + 0.009*"spirit" + 0.009*"money" + 0.008*"peopl" + 0.008*"great" + 0.007*"uniti" + 0.007*"influenc" + 0.007*"work"')
    MON_POL ('0.000*"rate" + 0.000*"credit" + 0.000*"financi" + 0.000*"market" + 0.000*"euro_area" + 0.000*"measur" + 0.000*"risk" + 0.000*"support" + 0.000*"monetari_polici" + 0.000*"expect"')
    MON_POL ('0.045*"growth" + 0.041*"euro_area" + 0.037*"econom" + 0.029*"product" + 0.025*"market" + 0.020*"countri" + 0.020*"increas" + 0.014*"competit" + 0.013*"labour" + 0.011*"reform"')
    EMU_AFF ('0.006*"tariff" + 0.005*"report" + 0.003*"announc" + 0.003*"examin" + 0.003*"impos" + 0.002*"scenario" + 0.002*"annualis" + 0.001*"steel" + 0.001*"simul" + 0.001*"seven"')
    MACPRUD ('0.069*"bank" + 0.037*"rate" + 0.032*"deposit" + 0.032*"cash" + 0.027*"commerci" + 0.026*"base" + 0.023*"technolog" + 0.018*"account" + 0.018*"negat" + 0.017*"provid"')
    MON_POL ('0.029*"rate" + 0.025*"bank" + 0.024*"monetari_polici" + 0.017*"econom" + 0.015*"euro_area" + 0.014*"polici" + 0.013*"crisi" + 0.013*"market" + 0.012*"financi" + 0.012*"credit"')
    MON_POL ('0.027*"econom" + 0.024*"crisi" + 0.022*"bank" + 0.022*"euro_area" + 0.018*"countri" + 0.017*"financi" + 0.013*"fiscal" + 0.013*"govern" + 0.013*"union" + 0.012*"need"')
    MON_POL ('0.002*"trust" + 0.001*"owner_occupi" + 0.001*"daili_transact" + 0.001*"monetari_polici" + 0.001*"coin" + 0.001*"absolut" + 0.001*"thing" + 0.001*"loos" + 0.001*"imposit" + 0.001*"see_sign"')
    INT_AFF ('0.065*"global" + 0.042*"econom" + 0.039*"intern" + 0.039*"financi" + 0.027*"globalis" + 0.023*"world" + 0.018*"trade" + 0.018*"emerg" + 0.017*"countri" + 0.017*"domest"')
    MON_POL ('0.010*"exchang_rate" + 0.004*"capit_flow" + 0.003*"portfolio" + 0.003*"differenti" + 0.003*"correl" + 0.003*"half" + 0.003*"spillov" + 0.002*"relationship" + 0.002*"inform" + 0.002*"composit"')
    MISCELL ('0.009*"museum" + 0.009*"europ" + 0.008*"child" + 0.007*"languag" + 0.006*"learn" + 0.005*"featur" + 0.005*"berlin" + 0.005*"educ" + 0.004*"million_citizen" + 0.004*"connect"')
    MISCELL ('0.001*"tenth" + 0.001*"extrem_difficult" + 0.001*"trip" + 0.001*"peter_member" + 0.000*"forg" + 0.000*"scienc" + 0.000*"video" + 0.000*"librari" + 0.000*"teach" + 0.000*"school"')
    MISCELL ('0.005*"interim_solut" + 0.003*"fellowship" + 0.001*"safer" + 0.001*"slowest" + 0.001*"build_bridg" + 0.001*"follow_suit" + 0.001*"guilder" + 0.001*"compens" + 0.001*"feder_republ" + 0.001*"river"')
    MON_POL ('0.044*"euro_area" + 0.039*"growth" + 0.036*"econom" + 0.029*"remain" + 0.029*"continu" + 0.028*"rate" + 0.020*"inflat" + 0.017*"support" + 0.016*"expect" + 0.016*"increas"')
    MON_POL ('0.071*"inflat" + 0.041*"econom" + 0.029*"model" + 0.025*"price" + 0.021*"monetari_polici" + 0.017*"polici" + 0.017*"shock" + 0.012*"inflat_expect" + 0.011*"output" + 0.010*"chang"')
    PAYSECS ('0.034*"payment" + 0.017*"bank" + 0.011*"system" + 0.011*"money" + 0.008*"particip" + 0.007*"oversight" + 0.007*"respons" + 0.007*"larg" + 0.006*"currenc" + 0.005*"public"')
    MISCELL ('0.001*"fatigu" + 0.001*"diverg" + 0.001*"transposit" + 0.001*"verdict" + 0.001*"dilig" + 0.001*"lawmak" + 0.001*"pendulum" + 0.001*"swing" + 0.001*"supervis_forbear" + 0.001*"near"')
    MACPRUD ('0.003*"secur" + 0.003*"portfolio" + 0.002*"represent" + 0.001*"held" + 0.001*"night" + 0.001*"silent" + 0.001*"copi" + 0.001*"gone" + 0.001*"strateg_vision" + 0.001*"seventh"')
    MACPRUD ('0.050*"financi" + 0.022*"risk" + 0.019*"bank" + 0.017*"supervis" + 0.016*"stabil" + 0.016*"institut" + 0.016*"crisi" + 0.015*"framework" + 0.014*"polici" + 0.013*"regul"')
    MON_POL ('0.015*"power" + 0.006*"firm" + 0.005*"markup" + 0.005*"concentr" + 0.003*"geograph_breakdown" + 0.003*"rise" + 0.002*"margin" + 0.002*"constant" + 0.001*"enter" + 0.001*"macroeconomist"')
    MISCELL ('0.023*"educ" + 0.010*"student" + 0.009*"woman" + 0.006*"profit" + 0.006*"profit_margin" + 0.006*"school" + 0.005*"graduat" + 0.005*"gender" + 0.005*"child" + 0.005*"mark"')
    MACPRUD ('0.037*"secur" + 0.030*"bank" + 0.011*"activ" + 0.010*"increas" + 0.009*"risk" + 0.007*"financi" + 0.007*"institut" + 0.007*"firm" + 0.007*"result" + 0.006*"exercis"')
    EMU_AFF ('0.032*"european" + 0.030*"union" + 0.027*"econom" + 0.025*"countri" + 0.019*"polici" + 0.019*"euro_area" + 0.019*"integr" + 0.018*"euro" + 0.018*"monetari" + 0.017*"europ"')
    MON_POL ('0.030*"econom" + 0.023*"support" + 0.022*"measur" + 0.020*"inflat" + 0.015*"euro_area" + 0.014*"polici" + 0.014*"recoveri" + 0.013*"monetari_polici" + 0.013*"pandem" + 0.012*"condit"')
    FIS_POL ('0.023*"deficit" + 0.022*"fiscal" + 0.015*"current_account" + 0.012*"rule" + 0.010*"pact" + 0.010*"spend" + 0.010*"countri" + 0.009*"save" + 0.008*"polici" + 0.007*"public"')
    EMU_AFF ('0.025*"eurosystem" + 0.005*"transpar" + 0.005*"vote" + 0.004*"mission" + 0.004*"oblig" + 0.004*"process" + 0.004*"decis" + 0.004*"relat" + 0.003*"nation" + 0.003*"european_parliament"')
    MACPRUD ('0.097*"bank" + 0.089*"supervis" + 0.020*"regul" + 0.020*"nation" + 0.016*"european" + 0.013*"institut" + 0.011*"risk" + 0.009*"rule" + 0.008*"financi" + 0.008*"level"')
    INNOVAT ('0.032*"statist" + 0.024*"financi" + 0.023*"euro_area" + 0.022*"european" + 0.022*"econom" + 0.017*"account" + 0.016*"nation" + 0.014*"data" + 0.012*"develop" + 0.011*"monetari"')
    INNOVAT ('0.071*"data" + 0.025*"inform" + 0.018*"financi" + 0.015*"analysi" + 0.011*"need" + 0.009*"statist" + 0.009*"sector" + 0.008*"collect" + 0.008*"report" + 0.007*"aggreg"')
    MON_POL ('0.025*"clear" + 0.014*"recoveri" + 0.011*"member" + 0.008*"resolut" + 0.007*"alloc" + 0.006*"mandatori_clear" + 0.005*"resourc" + 0.005*"effect" + 0.004*"recoveri_resolut" + 0.003*"loss"')]

     | Show Table
    DownLoad: CSV
    Table 12.  Word-topic distribution for the Federal Reserve.
    TOPIC WORDS
    MACPRUD ('0.114*"bank" + 0.025*"communiti" + 0.017*"supervis" + 0.016*"regul" + 0.013*"institut" + 0.012*"financi" + 0.012*"feder_reserv" + 0.010*"activ" + 0.009*"examin" + 0.008*"requir"')
    MON_POL ('0.035*"market" + 0.032*"financi" + 0.029*"feder_reserv" + 0.027*"bank" + 0.023*"fund" + 0.019*"liquid" + 0.018*"rate" + 0.018*"credit" + 0.015*"secur" + 0.013*"reserv"')
    MISCELL ('0.000*"econom" + 0.000*"will" + 0.000*"financi" + 0.000*"market" + 0.000*"rate" + 0.000*"feder_reserv" + 0.000*"bank" + 0.000*"increas" + 0.000*"risk" + 0.000*"time"')
    MON_POL ('0.046*"committe" + 0.034*"econom" + 0.032*"rate" + 0.031*"will" + 0.025*"inflat" + 0.022*"feder" + 0.019*"fund" + 0.018*"polici" + 0.017*"monetari_polici" + 0.017*"feder_reserv"')
    MACPRUD ('0.054*"market" + 0.037*"financi" + 0.018*"econom" + 0.014*"inform" + 0.012*"product" + 0.011*"investor" + 0.011*"will" + 0.011*"risk" + 0.007*"valu" + 0.007*"market_particip"')
    MACPRUD ('0.061*"bank" + 0.035*"capit" + 0.035*"risk" + 0.022*"supervis" + 0.020*"will" + 0.016*"requir" + 0.014*"regul" + 0.012*"institut" + 0.010*"propos" + 0.010*"approach"')
    MACPRUD ('0.041*"risk" + 0.030*"risk_manag" + 0.029*"manag" + 0.022*"organ" + 0.015*"complianc" + 0.015*"process" + 0.014*"financi" + 0.013*"control" + 0.013*"effect" + 0.012*"busi"')
    MON_POL ('0.021*"will" + 0.021*"econom" + 0.019*"fiscal" + 0.017*"rate" + 0.014*"increas" + 0.013*"deficit" + 0.011*"save" + 0.011*"govern" + 0.011*"budget" + 0.009*"polici"')
    MON_POL ('0.005*"against_background" + 0.005*"dissent" + 0.004*"heighten" + 0.004*"nonetheless" + 0.003*"avail" + 0.003*"pressur" + 0.003*"expertis" + 0.003*"background" + 0.002*"decid_maintain" + 0.002*"unusu_high"')
    PAYSECS ('0.108*"consum" + 0.026*"credit_card" + 0.020*"disclosur" + 0.020*"credit" + 0.019*"product" + 0.018*"inform" + 0.015*"effect" + 0.015*"rate" + 0.011*"chang" + 0.010*"test"')
    MON_POL ('0.038*"inflat" + 0.032*"econom" + 0.027*"rate" + 0.014*"polici" + 0.012*"monetari_polici" + 0.012*"growth" + 0.011*"will" + 0.011*"price" + 0.010*"expect" + 0.010*"time"')
    INNOVAT ('0.041*"product" + 0.032*"econom" + 0.026*"invest" + 0.023*"growth" + 0.019*"capit" + 0.018*"busi" + 0.015*"product_growth" + 0.015*"increas" + 0.014*"technolog" + 0.011*"will"')
    CONSCOM ('0.053*"bank" + 0.018*"communiti" + 0.015*"servic" + 0.012*"feder_reserv" + 0.012*"data" + 0.011*"small_busi" + 0.010*"financi" + 0.009*"provid" + 0.009*"know" + 0.009*"work"')
    MACPRUD ('0.026*"firm" + 0.024*"regul" + 0.021*"financi" + 0.020*"bank" + 0.017*"fund" + 0.017*"requir" + 0.016*"liquid" + 0.015*"asset" + 0.013*"market" + 0.012*"capit"')
    MACPRUD ('0.039*"firm" + 0.032*"bank" + 0.031*"supervis" + 0.027*"regul" + 0.024*"capit" + 0.020*"stress_test" + 0.018*"financi" + 0.017*"requir" + 0.014*"will" + 0.012*"intern"')
    CONSCOM ('0.028*"mortgag" + 0.024*"hous" + 0.024*"credit" + 0.024*"market" + 0.023*"loan" + 0.017*"econom" + 0.012*"home" + 0.012*"will" + 0.012*"rate" + 0.011*"borrow"')
    CONSCOM ('0.039*"econom" + 0.032*"educ" + 0.022*"work" + 0.020*"worker" + 0.015*"employ" + 0.014*"famili" + 0.011*"will" + 0.010*"skill" + 0.010*"student" + 0.009*"particip"')
    CONSCOM ('0.047*"communiti" + 0.025*"develop" + 0.023*"econom" + 0.018*"feder_reserv" + 0.015*"neighborhood" + 0.014*"work" + 0.012*"area" + 0.012*"help" + 0.011*"need" + 0.009*"will"')
    MON_POL ('0.076*"price" + 0.028*"increas" + 0.020*"energi" + 0.018*"will" + 0.017*"market" + 0.016*"rise" + 0.016*"product" + 0.015*"cost" + 0.013*"demand" + 0.011*"econom"')
    MACPRUD ('0.056*"market" + 0.045*"risk" + 0.020*"credit" + 0.019*"financi" + 0.013*"liquid" + 0.012*"investor" + 0.011*"trade" + 0.009*"manag" + 0.009*"counterparti" + 0.009*"risk_manag"')
    MACPRUD ('0.074*"financi" + 0.021*"risk" + 0.021*"bank" + 0.021*"crisi" + 0.018*"institut" + 0.018*"regul" + 0.015*"stabil" + 0.014*"supervis" + 0.014*"market" + 0.012*"firm"')
    INT_AFF ('0.057*"econom" + 0.037*"global" + 0.029*"countri" + 0.021*"intern" + 0.017*"unit_state" + 0.017*"financi" + 0.016*"trade" + 0.015*"growth" + 0.015*"world" + 0.012*"foreign"')
    CONSCOM ('0.039*"mortgag" + 0.032*"loan" + 0.021*"borrow" + 0.019*"consum" + 0.017*"lender" + 0.016*"market" + 0.015*"feder_reserv" + 0.013*"regul" + 0.012*"data" + 0.012*"practic"')
    CONSCOM ('0.031*"household" + 0.024*"debt" + 0.024*"financi" + 0.023*"rate" + 0.015*"increas" + 0.014*"incom" + 0.013*"busi" + 0.011*"asset" + 0.011*"econom" + 0.010*"corpor"')
    MON_POL ('0.093*"clear" + 0.068*"member" + 0.052*"central" + 0.042*"ccps" + 0.025*"deriv" + 0.020*"clearinghous" + 0.016*"default" + 0.012*"network" + 0.012*"financi" + 0.010*"exchang"')
    MISCELL ('0.000*"will" + 0.000*"econom" + 0.000*"market" + 0.000*"financi" + 0.000*"rate" + 0.000*"bank" + 0.000*"feder_reserv" + 0.000*"monetari_polici" + 0.000*"risk" + 0.000*"growth"')
    MISCELL ('0.000*"market" + 0.000*"will" + 0.000*"econom" + 0.000*"financi" + 0.000*"growth" + 0.000*"bank" + 0.000*"risk" + 0.000*"increas" + 0.000*"rate" + 0.000*"credit"')
    MON_POL ('0.044*"rate" + 0.024*"price" + 0.023*"polici" + 0.019*"monetari_polici" + 0.018*"asset" + 0.015*"econom" + 0.014*"effect" + 0.013*"market" + 0.012*"financi" + 0.011*"expect"')
    PAYSECS ('0.060*"payment" + 0.022*"bank" + 0.015*"technolog" + 0.015*"will" + 0.014*"servic" + 0.013*"feder_reserv" + 0.013*"provid" + 0.011*"system" + 0.011*"money" + 0.011*"innov"')
    MON_POL ('0.034*"polici" + 0.032*"monetari_polici" + 0.032*"central_bank" + 0.026*"econom" + 0.022*"feder_reserv" + 0.018*"inflat" + 0.013*"object" + 0.011*"public" + 0.010*"rate" + 0.010*"review"')]

     | Show Table
    DownLoad: CSV
    Table 13.  Ordered probit estimation results excluding the ELB.
    variables (Ⅰ) (Ⅱ) (Ⅲ) (Ⅳ) (Ⅴ) (Ⅵ)
    BANK OF ENGLAND
    Δrt 0.96***, (0.2392) 0.72***, (0.2548) 0.62**, (0.2628) 0.8***, (0.2502) 0.78***, (0.2511) 0.67***, (0.259)
    CBSIwt 0.44***, (0.1336) 0.5***, (0.1668)
    MON_POLt 0.51***, (0.1381) 0.57***, (0.164)
    ΔGDPt 0.37*, (0.1995) 0.43**, (0.1934) 0.37*, (0.1968)
    ΔINFt 0.32*, (0.1729) 0.09, (0.1523) 0.32*, (0.176)
    -1/0 -1.11***, (0.1583) -1.22***, (0.173) -1.2***, (0.1749) -1.1***, (0.1614) -1.19***, (0.1697) -1.18***, (0.1724)
    0/1 1.33***, (0.1708) 1.49***, (0.1904) 1.54***, (0.1963) 1.39***, (0.1738) 1.45***, (0.1852) 1.51***, (0.1911)
    pseudo R^2 0.1 0.1857 0.2141 0.1394 0.1654 0.1942
    LLR -77.79 -70.38 -67.93 -74.38 -72.13 -69.65
    EUROPEAN CENTRAL BANK
    Δrt 0.71***, (0.2186) 0.38, (0.2387) 0.26, (0.247) 0.34, (0.2417) 0.51**, (0.2303) 0.32, (0.2426)
    CBSIwt 0.35***, (0.109) 0.17, (0.1303)
    MON_POLt 0.51***, (0.1203) 0.36**, (0.143)
    ΔGDPt 0.29**, (0.1465) 0.49***, (0.1241) 0.4***, (0.1428)
    ΔINFt -0.07, (0.1131) -0.08, (0.1114) -0.08, (0.1118)
    -1/0 -1.3***, (0.1352) -1.43***, (0.1492) -1.44***, (0.1506) -1.4***, (0.1451) -1.37***, (0.1425) -1.41***, (0.1465)
    0/1 1.35***, (0.1355) 1.5***, (0.1542) 1.52***, (0.156) 1.46***, (0.1482) 1.42***, (0.1439) 1.47***, (0.1493)
    pseudo R^2 0.0478 0.1354 0.1532 0.1242 0.0962 0.1315
    LLR -107.95 -98.02 -96.01 -99.29 -102.46 -98.46
    FEDERAL RESERVE
    Δrt 1.05***, (0.1845) 1.07***, (0.1863) 0.96***, (0.192) 0.94***, (0.1903) 1.04***, (0.1859) 0.95***, (0.1913)
    CBSIwt 0.2**, (0.086) 0.16*, (0.0919)
    MON_POLt 0.19**, (0.0856) 0.16*, (0.0875)
    ΔGDPt 0.25**, (0.1011) 0.28***, (0.0995) 0.25**, (0.1023)
    ΔINFt -0.05, (0.0948) -0.05, (0.0945) -0.09, (0.0966)
    -1/0 -1.4***, (0.1237) -1.43***, (0.1252) -1.48***, (0.1314) -1.46***, (0.1307) -1.43***, (0.1248) -1.47***, (0.1313)
    0/1 1.32***, (0.1161) 1.35***, (0.1202) 1.39***, (0.1236) 1.36***, (0.1208) 1.35***, (0.1209) 1.38***, (0.1235)
    pseudo R^2 0.1064 0.1225 0.1421 0.1322 0.1235 0.1414
    LLR -145.17 -142.55 -139.37 -140.98 -142.39 -139.48
    The model is estimated by maximum likelihood. Robust standard errors are in parenthesis. Coefficients labelled with ***, **, * denote significance at 1%, 5%, 10% levels, respectively. ELB stands for Effective Lower Bound. The number of observations is 112,174 and 238 for the BoE, the ECB and the Fed, respectively.

     | Show Table
    DownLoad: CSV
    Table 14.  Average marginal effects excluding the ELB.
    Coefficients with ***, **, * are significant at 1%, 5%, 10% levels
    BoE (ex-ELB) P(Rate Cut) P(No Change) P(Rate Hike)
    (Ⅰ)
    Δrt -0.216*** 0.056 0.160***
    (Ⅱ)
    Δrt -0.145*** 0.035 0.109***
    MON_POLt -0.103*** 0.025 0.077***
    (Ⅲ)
    Δrt -0.117** 0.025 0.092**
    MON_POLt -0.107*** 0.023 0.085***
    ΔGDPt -0.069* 0.015 0.055*
    ΔINFt -0.061* 0.013 0.048*
    ECB (ex-ELB) P(Rate Cut) P(No Change) P(Rate Hike)
    (Ⅰ)
    Δrt -0.122*** 0.016 0.106***
    (Ⅱ)
    Δrt -0.058 0.009 0.049
    MON_POLt -0.083*** 0.013 0.071***
    (Ⅲ)
    Δrt -0.037 0.005 0.032
    MON_POLt -0.061*** 0.008 0.053***
    ΔGDPt -0.041** 0.005 0.036**
    ΔINFt 0.005 -0.001 -0.004
    Fed (ex-ELB) P(Rate Cut) P(No Change) P(Rate Hike)
    (Ⅰ)
    Δrt -0.164*** -0.026 0.190***
    (Ⅱ)
    Δrt -0.165*** -0.022 0.186***
    MON_POLt -0.030** -0.004 0.034**
    (Ⅲ)
    Δrt -0.141*** -0.025 0.166***
    MON_POLt -0.023* -0.004 0.027*
    ΔGDPt -0.037** -0.006 0.043**
    ΔINFt 0.007 0.001 -0.009

     | Show Table
    DownLoad: CSV


    [1] Apel M, Grimaldi MB (2014) How informative are central bank minutes? Review Econ 65: 53-76.https://doi.org/10.1515/roe-2014-0104 doi: 10.1515/roe-2014-0104
    [2] Armelius H, Bertsch C, Hull I, et al. (2020) Spread the word: International spillovers from central bank communication. J Int Money Financ 103: 102116.https://doi.org/10.1016/j.jimonfin.2019.102116 doi: 10.1016/j.jimonfin.2019.102116
    [3] Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval, 463. ACM press New York.
    [4] Benamar H, Foucault T, Vega C (2021) Demand for information, uncertainty, and the response of us treasury securities to news. Rev Financ Stud 347: 3403-3455. https://doi.org/10.1093/rfs/hhaa072 doi: 10.1093/rfs/hhaa072
    [5] Bennani H, Fanta N, Gertler P, et al. (2020) Does central bank communication signal future monetary policy in a (post)-crisis era? the case of the ecb. J Int Money Financ 104: 102167. https://doi.org/10.1016/j.jimonfin.2020.102167 doi: 10.1016/j.jimonfin.2020.102167
    [6] Birz G, Lott Jr JR (2011) The effect of macroeconomic news on stock returns: New evidence from newspaper coverage. J Bank Finance 35: 2791-2800. https://doi.org/10.1016/j.jbankfin.2011.03.006 doi: 10.1016/j.jbankfin.2011.03.006
    [7] Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3: 993-1022.
    [8] Carboni M, Farina V, Previati DA(2020) Ecb and fed governors' speeches: A topic modeling analysis (2007-2019). In Banking and Beyond, Springer, 9-25.
    [9] Correa R, Garud K, Londono JM, et al. (2017) Constructing a dictionary for financial stability. Board Governors Fed Reserve System, Washington, DC. https://doi.org/10.17016/2573-2129.33
    [10] Cour-Thimann P (2020) Interest rate setting and communication at the ecb. http://dx.doi.org/10.2139/ssrn.3654147
    [11] Dincer NN, Eichengreen B (2013) Central bank transparency and independence: updates and new measures. http://dx.doi.org/10.2139/ssrn.2579544
    [12] Gelman A, Hill J (2006) Data analysis using regression and multilevel/hierarchical models. Cambridge university press.
    [13] Griffiths TL, Steyvers M (2004) Finding scientific topics. P NATL A SCI INDIA B 101: 5228-5235. https://doi.org/10.1073/pnas.030775210 doi: 10.1073/pnas.030775210
    [14] Grün B, Hornik K (2011) topicmodels: An r package for fitting topic models. J Stat Software 40: 1-30. http://www.jstatsoft.org/v40/i13
    [15] Hansen S, McMahon M, Prat A (2018) Transparency and deliberation within the fomc: a computational linguistics approach. Q J Econ 133: 801-870. https://doi.org/10.1093/qje/qjx045 doi: 10.1093/qje/qjx045
    [16] Hayo B, Neuenkirch M (2010) Do federal reserve communications help predict federal funds target rate decisions? J Macroecon 32: 1014-1024. https://doi.org/10.1016/j.jmacro.2010.06.003 doi: 10.1016/j.jmacro.2010.06.003
    [17] Kalman RE (1960). A new approach to linear filtering and prediction problems" transaction of the asme journal of basic. https://doi.org/10.1115/1.3662552
    [18] Keida M, Takeda Y (2018) The arts of central bank communication: A topic analysis on words of the bank of japan s governors. AEA Annual Conference.
    [19] Leskovec J, Rajaraman A, Ullman JD (2020). Mining of massive data sets. Cambridge university press.
    [20] Logsdon BA, Hoffman GE, Mezey JG(2010) A variational bayes algorithm for fast and accurate multiple locus genome-wide association analysis. BMC Bioinformatics 11: 1-13. https://doi.org/10.1186/1471-2105-11-58 doi: 10.1186/1471-2105-11-58
    [21] Loughran T, McDonald B (2011). When is a liability not a liability? textual analysis, dictionaries, and 10-ks. J Financ 66: 35-65. https://doi.org/10.1111/j.1540-6261.2010.01625.x doi: 10.1111/j.1540-6261.2010.01625.x
    [22] McFadden D (1974) Conditional logit analysis of qualitative choice behavior. Front Economet 50: 105-142.
    [23] Navarro DJ, Griffiths TL, Steyvers M, et al. (2006) Modeling individual differences using dirichlet processes. J Math Psychol 50: 101-122. https://doi.org/10.1016/j.jmp.2005.11.006 doi: 10.1016/j.jmp.2005.11.006
    [24] Picault M, Renault T (2017) Words are not all created equal: A new measure of ecb communication. J Int Money Financ 79: 136-156. https://doi.org/10.1016/j.jimonfin.2017.09.005 doi: 10.1016/j.jimonfin.2017.09.005
    [25] Priola MP, Lorenzini P, Tizzanini G, et al. (2021) Measuring central banks' sentiment and its spillover effects with a network approach. http://dx.doi.org/10.2139/ssrn.3764004
    [26] Röder M, Both A, Hinneburg A (2015) Exploring the space of topic coherence measures. In Proceedings of the eighth ACM international conference on Web search and data mining 399-408. https://doi.org/10.1145/2684822.2685324
    [27] Rosa C (2009) Forecasting the direction of policy rate changes: The importance of ecb words. Econ Notes 38, 39-66. https://doi.org/10.1111/j.1468-0300.2009.00209.x doi: 10.1111/j.1468-0300.2009.00209.x
    [28] Shapiro AH, Sudhof M, Wilson DJ (2020). Measuring news sentiment. J Econom. https://doi.org/10.1016/j.jeconom.2020.07.053 doi: 10.1016/j.jeconom.2020.07.053
    [29] Silge J, Robinson D (2017) Text mining with R: A tidy approach. "O'Reilly Media, Inc.".
    [30] Silva JS, Cardoso F (2001) The chow-lin method using dynamic models. Econ model 18: 269-280. https://doi.org/10.1016/S0264-9993(00)00039-0 doi: 10.1016/S0264-9993(00)00039-0
    [31] Swanson ET, Williams JC (2014) Measuring the effect of the zero lower bound on medium-and longer-term interest rates. Am Econ Rev 104: 3154-3185. https://doi.org/10.1257/aer.104.10.3154 doi: 10.1257/aer.104.10.3154
    [32] Tumala MM, Omotosho BS (2019) A text mining analysis of central bank monetary policy communication in nigeria. CBN J Appl Stat 10. http://dx.doi.org/10.2139/ssrn.3545508 doi: 10.2139/ssrn.3545508
    [33] Witmer J, Yang J (2015) Estimating canada's effective lower bound. Technical report, Bank of Canada.
    [34] Wu JC, Xia FD (2016). Measuring the macroeconomic impact of monetary policy at the zero lower bound. J Money, Credit Banking 48: 253-291. https://doi.org/10.1111/jmcb.12300 doi: 10.1111/jmcb.12300
    [35] Zipf G (1936) The psycho-biology of language, routledge, london. https://doi.org/10.2307/408930
  • DSFE-02-03-009-s001.pdf
  • This article has been cited by:

    1. Wenjie Li, Zimei Huang, Do different stock indices volatility respond differently to Central bank digital currency signals?, 2023, 31, 2688-1594, 5573, 10.3934/era.2023283
    2. Christian Admin De la Huerta Avila, The Predictive Power of Central Bank Communication: Evidence from Mexico, 2023, 1556-5068, 10.2139/ssrn.4595144
  • Reader Comments
  • © 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(3311) PDF downloads(257) Cited by(2)

Figures and Tables

Figures(18)  /  Tables(14)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog