- A2 Multi-class Classification Our sample period spans from January 3, 2006 to December 7, 2024. The former date marks the first date that FOMC statements are available in HTML format on the Federal Reserve website, while the latter date represents the date when we began collecting the data. We gather all 48 FOMC statements regarding monetary policy during this period. We parse these statements and remove the last paragraph that describes the voting results. We then tokenize the plain text into sentences and remove duplicate ones. This process yields 1,096 unique sentences, comprising approximately 27,000 words.
Paper not yet in RePEc: Add citation now
- Aguda T, Siddagangappa S, Kochkina E, Kaur S, Wang D, Smiley C, Shah S (2024) Large language models as financial data annotators: A study on effectiveness and efficiency.
Paper not yet in RePEc: Add citation now
- Ahmed H, Lofstead J (2022) Managing randomness to enable reproducible machine learning. Proceedings of the 5th International Workshop on Practical Reproducible Evaluation of Computer Systems P-RECS ’22. (Association for Computing Machinery, New York, NY, USA), 15–20.
Paper not yet in RePEc: Add citation now
Alonso-Robisco A, Carbó JM (2023) Analysis of CBDC narrative by central banks using large language models. Finance Research Letters 58:104643.
- Artstein R, Poesio M (2008) Inter-coder agreement for computational linguistics. Computational Linguistics 34(4):555–596.
Paper not yet in RePEc: Add citation now
- Bae J, Berger AN, Choi HS, Kim HH (2024) Bank sentiment and loan loss provisioning. (March 2) https://papers.ssrn.com/abstract=4745996.
Paper not yet in RePEc: Add citation now
- Bai J (Jianqiu), Boyson NM, Cao Y, Liu M, Wan C (2023) Executives vs. Chatbots: Unmasking insights through human-AI differences in earnings conference Q&A. (June 15) https://papers.ssrn.com/abstract=4480056.
Paper not yet in RePEc: Add citation now
- Bernard D, Blankespoor E, de Kok T, Toynbee S (2023) Confused readers: A modular measure of business complexity. (June 15) https://papers.ssrn.com/abstract=4480309.
Paper not yet in RePEc: Add citation now
- Blankespoor E, Croom J, Grant SM (2024) Generative AI and investor processing of financial information. (December 12) https://papers.ssrn.com/abstract=5053905.
Paper not yet in RePEc: Add citation now
- Bochkay K, Brown SV, Leone AJ, Tucker JW (2022) Textual analysis in accounting: What’s next? Contemporary Accounting Research (Forthcoming).
Paper not yet in RePEc: Add citation now
- Bond SA, Klok H, Zhu M (2025) Large language models and financial market sentiment. (January 26) https://papers.ssrn.com/abstract=4584928.
Paper not yet in RePEc: Add citation now
- Bozanic Z, Roulstone DT, Van Buskirk A (2018) Management earnings forecasts and other forward-looking statements. Journal of Accounting and Economics 65(1):1–20.
Paper not yet in RePEc: Add citation now
- Breitung C, Kruthof G, Müller S (2023) Contextualized sentiment analysis using large language models. (October 27) https://papers.ssrn.com/abstract=4615038.
Paper not yet in RePEc: Add citation now
- Brown SV, Hinson LA, Tucker JW (2024) Financial statement adequacy and firms’ MD&A disclosures. Contemporary Accounting Research 41(1):126–162.
Paper not yet in RePEc: Add citation now
Chen AY, Zimmermann T (2022) Open source cross-sectional asset pricing. CFR 11(2):207–264.
Chen B, Wu Z, Zhao R (2023) From fiction to fact: the growing role of generative AI in business and finance. Journal of Chinese Economic and Business Studies 21(4):471–496.
- Chen J, Tang G, Zhou G, Zhu W (2023) ChatGPT, stock market predictability and links to the macroeconomy. (July 31) https://papers.ssrn.com/abstract=4660148.
Paper not yet in RePEc: Add citation now
- Comlekci İ, Unal S, Ozer A, Oncu MA (2023) Can AI technologies estimate financials accurately? A research on Borsa Istanbul with ChatGPT. (April 8) https://papers.ssrn.com/abstract=4545954.
Paper not yet in RePEc: Add citation now
- Dasgupta S, Li EXN, Wu S (2023) Inferring financial flexibility: Do actions speak louder than words? (June 17) https://papers.ssrn.com/abstract=4482620. 27 Dong MM, Stratopoulos TC, Wang VX (2024) A scoping review of ChatGPT research in accounting and finance. International Journal of Accounting Information Systems 55:100715.
Paper not yet in RePEc: Add citation now
- de Kok T (2025) ChatGPT for textual analysis? How to use generative LLMs in accounting research. Management Science. 28 Krippendorff K (2004) Reliability in content analysis: Some common misconceptions and recommendations. Human Communication Research 30(3):411–433.
Paper not yet in RePEc: Add citation now
- Efron B, Tibshirani RJ (1994) An introduction to the bootstrap (Chapman and Hall/CRC, New York).
Paper not yet in RePEc: Add citation now
- Fleiss JL (1971) Measuring nominal scale agreement among many raters. Psychological Bulletin 76(5):378–382.
Paper not yet in RePEc: Add citation now
- Gilardi F, Alizadeh M, Kubli M (2023) ChatGPT outperforms crowd-workers for text-annotation tasks. Proc. Natl. Acad. Sci. U.S.A. 120(30):e2305016120.
Paper not yet in RePEc: Add citation now
Glasserman P, Lin C (2023) Assessing look-ahead bias in stock return predictions generated by GPT sentiment analysis. (September 28) https://papers.ssrn.com/abstract=4586726.
- Gow ID (2025) The Elephant in the Room: p-hacking and Accounting Research. Accounting, Economics, and Law: A Convivium 15(1):81–98.
Paper not yet in RePEc: Add citation now
- Gundersen OE, Kjensmo S (2018) State of the art: Reproducibility in artificial intelligence. Proceedings of the AAAI Conference on Artificial Intelligence 32(1).
Paper not yet in RePEc: Add citation now
Hail L, Lang M, Leuz C (2020) Reproducibility in accounting research: Views of the research community. Journal of Accounting Research 58(2):519–543.
- Hammersley J (2013) Monte Carlo methods (Springer Science & Business Media).
Paper not yet in RePEc: Add citation now
- Hansen AL, Kazinnik S (2024) Can ChatGPT decipher fedspeak? (April 10) https://papers.ssrn.com/abstract=4399406.
Paper not yet in RePEc: Add citation now
Harvey CR (2019) Replication in financial economics. (July 29) https://papers.ssrn.com/abstract=3409466.
Hou K, Xue C, Zhang L (2020) Replicating anomalies. The Review of Financial Studies 33(5):2019–2133.
- Hu N, Liang P, Yang X (2023) Whetting all your appetites for financial tasks with one meal from GPT? A comparison of GPT, FinBERT, and dictionaries in evaluating sentiment analysis. (July 26) https://papers.ssrn.com/abstract=4426455.
Paper not yet in RePEc: Add citation now
- Huang AH, Wang H, Yang Y (2023) FinBERT: A large language model for extracting information from financial text. Contemporary Accounting Research 40(2):806–841.
Paper not yet in RePEc: Add citation now
- Jensen TI, Kelly B, Pedersen LH (2023) Is there a replication crisis in finance? The Journal of Finance 78(5):2465–2518.
Paper not yet in RePEc: Add citation now
- Jia N, Li N, Ma G, Xu D (2024) Corporate responses to generative AI: Early evidence from conference calls. (February 23) https://papers.ssrn.com/abstract=4736295.
Paper not yet in RePEc: Add citation now
Kim AG, Muhn M, Nikolaev VV (2024) Financial statement analysis with large language models. (November 7) https://papers.ssrn.com/abstract=4835311.
Kirtac K, Germano G (2024) Sentiment trading with large language models. (March 21) https://papers.ssrn.com/abstract=4706629.
- Krippendorff K (2011) Computing Krippendorff’s alpha-reliability.
Paper not yet in RePEc: Add citation now
- Kuroki Y, Manabe T, Nakagawa K (2023) Fact or opinion? – Essential value for financial results briefing. (April 27) https://papers.ssrn.com/abstract=4430511.
Paper not yet in RePEc: Add citation now
Leippold M (2023) Sentiment spin: Attacking financial sentiment with GPT-3. Finance Research Letters 55:103957.
- Levy B (2024) Caution ahead: Numerical reasoning and look-ahead bias in AI models. (December 25) https://papers.ssrn.com/abstract=5082861.
Paper not yet in RePEc: Add citation now
Li EX, Tu Z, Zhou D (2024) The promise and peril of generative AI: Evidence from GPT-4 as sell-side analysts. (December 1) https://papers.ssrn.com/abstract=4480947.
Li F (2008) Annual report readability, current earnings, and earnings persistence. Journal of Accounting and Economics 45(2):221–247.
Li F (2010) The information content of forward-looking statements in corporate filings—A naïve Bayesian machine learning approach. Journal of Accounting Research 48(5):1049–1102.
- Li T, Peng Q, Yu L (2023) ESG considerations in acquisitions and divestitures: Corporate responses to mandatory ESG disclosure. (May 26) https://papers.ssrn.com/abstract=4376676.
Paper not yet in RePEc: Add citation now
Linnainmaa JT, Roberts MR (2018) The history of the cross-section of stock returns. The Review of Financial Studies 31(7):2606–2649.
Lopez-Lira A, Tang Y (2024) Can ChatGPT forecast stock price movements? Return predictability and large language models. (April 14) https://papers.ssrn.com/abstract=4412788.
Loughran T, McDonald B (2011) When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. The Journal of Finance 66(1):35–65.
Loughran T, McDonald B (2014) Measuring readability in financial disclosures. The Journal of Finance 69(4):1643–1671.
lvarez-Díez S, Baixauli-Soler JS, Kondratenko A, Lozano-Reina G (2024) Dividend announcement and the value of sentiment analysis. Journal of Management Analytics 0(0):1–21.
Malo P, Sinha A, Takala P, Korhonen P, Wallenius J (2013) Good debt or bad debt: Detecting semantic orientations in economic texts. (July 23) http://arxiv.org/abs/1307.5336.
- McHugh ML (2012) Interrater reliability: The Kappa statistic. Biochem Med (Zagreb) 22(3):276–282.
Paper not yet in RePEc: Add citation now
- Menkveld AJ, Dreber A, Holzmeister F, Huber J, Johannesson M, Kirchler M, NEUSÜß S, et al. (2024) Nonstandard errors. The Journal of Finance 79(3):2339–2390.
Paper not yet in RePEc: Add citation now
- Muslu V, Radhakrishnan S, Subramanyam KR, Lim D (2014) Forward-looking MD&A disclosures and the information environment. Management Science 61(5):931–948.
Paper not yet in RePEc: Add citation now
Pérignon C, Akmansoy O, Hurlin C, Dreber A, Holzmeister F, Huber J, Johannesson M, et al. (2024) Computational reproducibility in finance: Evidence from 1,000 tests. The Review of Financial Studies 37(11):3558–3593.
- Pineau J, Vincent-Lamarre P, Sinha K, Lariviere V, Beygelzimer A, d’Alche-Buc F, Fox E, Larochelle H (2021) Improving reproducibility in machine learning research. Journal of Machine Learning Research 22(164):1–20.
Paper not yet in RePEc: Add citation now
- Raff E (2019) A step toward quantifying independently reproducible machine learning research.
Paper not yet in RePEc: Add citation now
- Reiss MV (2023) Testing the reliability of ChatGPT for text annotation and classification: A cautionary remark. (April 17) http://arxiv.org/abs/2304.11085. 29 Rossi L, Harrison K, Shklovski I (2024) The problems of LLM-generated data in social science research. Sociologica 18(2):145–168.
Paper not yet in RePEc: Add citation now
- Sarkar SK, Vafa K (2024) Lookahead bias in pretrained language models. (June 28) https://papers.ssrn.com/abstract=4754678.
Paper not yet in RePEc: Add citation now
- Shaffer M, Wang CCY (2024) Scaling core earnings measurement with large language models. (October 8) https://papers.ssrn.com/abstract=4979501.
Paper not yet in RePEc: Add citation now
- Shao J, Tu D (2012) The Jackknife and bootstrap (Springer Science & Business Media).
Paper not yet in RePEc: Add citation now
Smales LA (2023) Classification of RBA monetary policy announcements using ChatGPT. Finance Research Letters 58:104514.
- Sturua S, Mohr I, Akram MK, Günther M, Wang B, Krimmel M, Wang F, et al. (2024) jinaembeddings -v3: Multilingual embeddings with task LoRA. (September 19) http://arxiv.org/abs/2409.10173.
Paper not yet in RePEc: Add citation now
- Törnberg P (2023) ChatGPT-4 outperforms experts and crowd workers in annotating political twitter messages with zero-shot learning. (April 13) http://arxiv.org/abs/2304.06588.
Paper not yet in RePEc: Add citation now
Wang J, Wang VX (2024) Leveraging large language models to democratize access to costly financial datasets for academic research. (November 1) https://papers.ssrn.com/abstract=5012660.
- Wu Z, Dong Y, Li Y, Shi B (2023) Unleashing the power of text for credit default prediction: Comparing human-generated and AI-generated texts. (June 6) https://papers.ssrn.com/abstract=4601317.
Paper not yet in RePEc: Add citation now