Identifying environmental information disclosure manipulation behavior via machine learning

IF 4.7 3区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES
Xiang Cai, Jia-jun Wan, Ying-Ying Jiang, Nan Zhou, Lei Wang, Chen-Meng Wu, Ye Tian
{"title":"Identifying environmental information disclosure manipulation behavior via machine learning","authors":"Xiang Cai, Jia-jun Wan, Ying-Ying Jiang, Nan Zhou, Lei Wang, Chen-Meng Wu, Ye Tian","doi":"10.1007/s10668-024-05340-8","DOIUrl":null,"url":null,"abstract":"<p>Corporate environmental information disclosure manipulation (EIDM) has a high level of concealment, which brings great challenges to the identification and judgment of manipulation behavior. Compared to traditional methods, machine learning techniques excel in handling large and complex datasets while achieving higher accuracy. This research applies machine learning techniques to construct the identification model of EIDM behavior and carry out the identification research of EIDM behavior. Based on the “public pressure” theory, the detection indicators will be improved from three aspects: public pressure, corporate governance, and financial indicators. By combining the collected environmental pollution penalty cases of Chinese listed companies from 2011 to 2020 with a pressure pool indicator system, we establish a training set and a test set to compare the identification ability of the logistic regression (LR), decision tree (DT), Support Vector Machine (SVM), Backpropagation (BP) Neural Network, and random forest (RF) models. Additionally, during the initial phase of model training, hyperparameter tuning is conducted across these models to ensure the maximization of their performance. For imbalanced data, after comparing the two oversampling techniques of the Borderline synthetic minority oversampling technique (Borderline SMOTE) and adaptive synthetic sampling (ADASYN), our study indicates that the Borderline SMOTE model has a better recognition effect than ADASYN and that the Borderline SMOTE-RF model is superior to the LR, DT, BP, and SVM models. We hope that our research can provide a reference for regulatory authorities, accelerate the improvement of the mandatory environmental information disclosure (EID) system of listed companies, improve the identification and early warning capabilities of EIDM, and promote the improvement of EID quality.</p>","PeriodicalId":540,"journal":{"name":"Environment, Development and Sustainability","volume":"8 1","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment, Development and Sustainability","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10668-024-05340-8","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Corporate environmental information disclosure manipulation (EIDM) has a high level of concealment, which brings great challenges to the identification and judgment of manipulation behavior. Compared to traditional methods, machine learning techniques excel in handling large and complex datasets while achieving higher accuracy. This research applies machine learning techniques to construct the identification model of EIDM behavior and carry out the identification research of EIDM behavior. Based on the “public pressure” theory, the detection indicators will be improved from three aspects: public pressure, corporate governance, and financial indicators. By combining the collected environmental pollution penalty cases of Chinese listed companies from 2011 to 2020 with a pressure pool indicator system, we establish a training set and a test set to compare the identification ability of the logistic regression (LR), decision tree (DT), Support Vector Machine (SVM), Backpropagation (BP) Neural Network, and random forest (RF) models. Additionally, during the initial phase of model training, hyperparameter tuning is conducted across these models to ensure the maximization of their performance. For imbalanced data, after comparing the two oversampling techniques of the Borderline synthetic minority oversampling technique (Borderline SMOTE) and adaptive synthetic sampling (ADASYN), our study indicates that the Borderline SMOTE model has a better recognition effect than ADASYN and that the Borderline SMOTE-RF model is superior to the LR, DT, BP, and SVM models. We hope that our research can provide a reference for regulatory authorities, accelerate the improvement of the mandatory environmental information disclosure (EID) system of listed companies, improve the identification and early warning capabilities of EIDM, and promote the improvement of EID quality.

Abstract Image

通过机器学习识别环境信息披露操纵行为
企业环境信息披露操纵(EIDM)具有高度的隐蔽性,这给操纵行为的识别和判断带来了巨大挑战。与传统方法相比,机器学习技术在处理大型复杂数据集方面表现出色,同时能获得更高的准确性。本研究应用机器学习技术构建了EIDM行为的识别模型,并开展了EIDM行为的识别研究。基于 "舆论压力 "理论,从舆论压力、公司治理、财务指标三个方面完善检测指标。结合收集到的 2011-2020 年中国上市公司环境污染处罚案例和压力池指标体系,建立训练集和测试集,比较逻辑回归模型(LR)、决策树模型(DT)、支持向量机模型(SVM)、反向传播神经网络模型(BP)和随机森林模型(RF)的识别能力。此外,在模型训练的初始阶段,对这些模型进行超参数调整,以确保其性能最大化。对于不平衡数据,在比较了边界线合成少数群体超采样技术(Borderline SMOTE)和自适应合成采样(ADASYN)这两种超采样技术后,我们的研究表明,边界线 SMOTE 模型的识别效果优于 ADASYN,而边界线 SMOTE-RF 模型则优于 LR、DT、BP 和 SVM 模型。希望我们的研究能为监管部门提供参考,加快完善上市公司强制性环境信息披露(EID)制度,提高EIDM的识别和预警能力,促进EID质量的提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environment, Development and Sustainability
Environment, Development and Sustainability Economics, Econometrics and Finance-Economics and Econometrics
CiteScore
10.20
自引率
6.10%
发文量
754
期刊介绍: Environment, Development and Sustainability is an international and multidisciplinary journal covering all aspects of the environmental impacts of socio-economic development. It is also concerned with the complex interactions which occur between development and environment, and its purpose is to seek ways and means for achieving sustainability in all human activities aimed at such development. The subject matter of the journal includes the following and related issues: -mutual interactions among society, development and environment, and their implications for sustainable development -technical, economic, ethical and philosophical aspects of sustainable development -global sustainability - the obstacles and ways in which they could be overcome -local and regional sustainability initiatives, their practical implementation, and relevance for use in a wider context -development and application of indicators of sustainability -development, verification, implementation and monitoring of policies for sustainable development -sustainable use of land, water, energy and biological resources in development -impacts of agriculture and forestry activities on soil and aquatic ecosystems and biodiversity -effects of energy use and global climate change on development and sustainability -impacts of population growth and human activities on food and other essential resources for development -role of national and international agencies, and of international aid and trade arrangements in sustainable development -social and cultural contexts of sustainable development -role of education and public awareness in sustainable development -role of political and economic instruments in sustainable development -shortcomings of sustainable development and its alternatives.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信