How informative is the text of securities complaints?

Adam B Badawi
{"title":"How informative is the text of securities complaints?","authors":"Adam B Badawi","doi":"10.1093/jleo/ewad003","DOIUrl":null,"url":null,"abstract":"Abstract Much of the research in law and finance reduces complex texts down to a handful of variables. Legal scholars have voiced concerns that this dimensionality reduction loses important detail that is embedded in legal text. This article assesses this critique by asking whether text analysis can capture meaningful predictive information. It does so by applying text analysis and machine learning to a corpus of private securities class action complaints that contains over 90 million words. This analysis produces three primary findings: (1) the best performing models predict outcomes with an accuracy rate of about 70%, which is higher than baseline rates; (2) a hybrid model that uses both text and nontext components performs better than either of these two components alone; and (3) the predictions made by the machine learning models are associated with substantial abnormal returns in the days after cases get filed.","PeriodicalId":485552,"journal":{"name":"The journal of law, economics, & organization","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The journal of law, economics, & organization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jleo/ewad003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Much of the research in law and finance reduces complex texts down to a handful of variables. Legal scholars have voiced concerns that this dimensionality reduction loses important detail that is embedded in legal text. This article assesses this critique by asking whether text analysis can capture meaningful predictive information. It does so by applying text analysis and machine learning to a corpus of private securities class action complaints that contains over 90 million words. This analysis produces three primary findings: (1) the best performing models predict outcomes with an accuracy rate of about 70%, which is higher than baseline rates; (2) a hybrid model that uses both text and nontext components performs better than either of these two components alone; and (3) the predictions made by the machine learning models are associated with substantial abnormal returns in the days after cases get filed.
证券投诉书内容的信息量有多大?
许多法律和金融研究将复杂的文本简化为几个变量。法律学者担心这种降维会失去法律文本中包含的重要细节。本文通过询问文本分析是否可以捕获有意义的预测信息来评估这种批评。它通过将文本分析和机器学习应用于包含超过9000万字的私人证券集体诉讼投诉语料库来实现这一目标。该分析产生了三个主要发现:(1)表现最好的模型预测结果的准确率约为70%,高于基线率;(2)同时使用文本和非文本组件的混合模型比单独使用这两种组件中的任何一种都更好;(3)机器学习模型做出的预测与案件提交后几天内的大量异常回报有关。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信