The Linguistic Properties of Award-winning Annual Reports

Jacqueline Gagnon, S. Young, P. Alves
{"title":"The Linguistic Properties of Award-winning Annual Reports","authors":"Jacqueline Gagnon, S. Young, P. Alves","doi":"10.2139/ssrn.3575679","DOIUrl":null,"url":null,"abstract":"We develop and test a model of high quality annual report discourse. The model is trained and evaluated on reports published between 2007 and 2018 by London Stock Exchange-listed firms shortlisted for an award by corporate reporting experts. We use methods from computational linguistics to identify an initial set of 19 features that distinguish quality according to what management say (i.e.: content) and how they say it (i.e.: language structure). We supplement these features with popular bag-of words proxies drawn from extant research (document length, reading ease, net tone, forward-looking content, and uncertainty). Stepwise regression yields a parsimonious quality model comprising 10 features that suggest more strategy-related commentary, less focus on growth, and greater language accessibility that promotes cognitive processing (evidenced by more relevancy markers, greater connectivity, more exclusive forms of language, and fewer grammatical words). The model predicts over 70% of shortlisting cases in out-of-sample tests and outperforms a baseline model comprising popular bag-of-words features.","PeriodicalId":202880,"journal":{"name":"Research Methods & Methodology in Accounting eJournal","volume":"25 7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research Methods & Methodology in Accounting eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3575679","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We develop and test a model of high quality annual report discourse. The model is trained and evaluated on reports published between 2007 and 2018 by London Stock Exchange-listed firms shortlisted for an award by corporate reporting experts. We use methods from computational linguistics to identify an initial set of 19 features that distinguish quality according to what management say (i.e.: content) and how they say it (i.e.: language structure). We supplement these features with popular bag-of words proxies drawn from extant research (document length, reading ease, net tone, forward-looking content, and uncertainty). Stepwise regression yields a parsimonious quality model comprising 10 features that suggest more strategy-related commentary, less focus on growth, and greater language accessibility that promotes cognitive processing (evidenced by more relevancy markers, greater connectivity, more exclusive forms of language, and fewer grammatical words). The model predicts over 70% of shortlisting cases in out-of-sample tests and outperforms a baseline model comprising popular bag-of-words features.
获奖年度报告的语言特性
我们开发并检验了一个高质量年度报告话语模型。该模型是根据伦敦证券交易所上市公司2007年至2018年间发布的报告进行培训和评估的,这些公司入围了企业报告专家的奖项。我们使用计算语言学的方法来识别19个特征的初始集合,这些特征根据管理层所说的内容(即:内容)和他们如何说(即:语言结构)来区分质量。我们从现有的研究(文档长度、阅读难易程度、网络语气、前瞻性内容和不确定性)中提取流行的词包代理来补充这些特征。逐步回归产生了一个包含10个特征的简约质量模型,这些特征表明更多的与策略相关的评论,更少的关注增长,以及促进认知处理的更大的语言可访问性(通过更多的相关性标记,更大的连通性,更独特的语言形式和更少的语法单词来证明)。该模型在样本外测试中预测了超过70%的入围案例,并且优于包含流行词袋特征的基线模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信