基于机器学习的企业污染物水平虚报概率评估——以危险废物产生量为例

IF 11.2 1区 环境科学与生态学 Q1 ENGINEERING, ENVIRONMENTAL
Wen Fang , Qingyuan Yu , Wenjun Xie , Jianxun Yang , Zongwei Ma , Miaomiao Liu , Jun Bi
{"title":"基于机器学习的企业污染物水平虚报概率评估——以危险废物产生量为例","authors":"Wen Fang ,&nbsp;Qingyuan Yu ,&nbsp;Wenjun Xie ,&nbsp;Jianxun Yang ,&nbsp;Zongwei Ma ,&nbsp;Miaomiao Liu ,&nbsp;Jun Bi","doi":"10.1016/j.resconrec.2025.108407","DOIUrl":null,"url":null,"abstract":"<div><div>Industrial enterprises are prominent sources of contaminants, and accurate reporting of contaminant discharge by these enterprises is essential for effective environmental management. However, fraudulent reporting, particularly underreporting hazardous waste (HW) generation, has become an intractable issue. HW is difficult to monitor automatically, and underreporting often implies improper treatment or illegal dumping, which can cause severe environmental risks. Regulators face challenges in detecting underreporting due to the lack of prior knowledge about the theoretical HW generation at the firm level. This study proposes a machine learning framework to detect fraudulent reporting by correlating HW generation with enterprises’ static characteristics and real-time manufacturing activities, such as wastewater discharge. The model was tested on two industrial sectors in Jiangsu, China, with datasets containing 10 % fraudulent samples. It achieved an accuracy of 42.9 %-55 % when observations with a probability of underreporting HW generation exceeding 90 % were flagged as fraudulent.</div></div>","PeriodicalId":21153,"journal":{"name":"Resources Conservation and Recycling","volume":"221 ","pages":"Article 108407"},"PeriodicalIF":11.2000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Probabilistic evaluation of enterprises' fraudulent report in pollutant levels based on machine learning: a case of hazardous waste generation quantity\",\"authors\":\"Wen Fang ,&nbsp;Qingyuan Yu ,&nbsp;Wenjun Xie ,&nbsp;Jianxun Yang ,&nbsp;Zongwei Ma ,&nbsp;Miaomiao Liu ,&nbsp;Jun Bi\",\"doi\":\"10.1016/j.resconrec.2025.108407\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Industrial enterprises are prominent sources of contaminants, and accurate reporting of contaminant discharge by these enterprises is essential for effective environmental management. However, fraudulent reporting, particularly underreporting hazardous waste (HW) generation, has become an intractable issue. HW is difficult to monitor automatically, and underreporting often implies improper treatment or illegal dumping, which can cause severe environmental risks. Regulators face challenges in detecting underreporting due to the lack of prior knowledge about the theoretical HW generation at the firm level. This study proposes a machine learning framework to detect fraudulent reporting by correlating HW generation with enterprises’ static characteristics and real-time manufacturing activities, such as wastewater discharge. The model was tested on two industrial sectors in Jiangsu, China, with datasets containing 10 % fraudulent samples. It achieved an accuracy of 42.9 %-55 % when observations with a probability of underreporting HW generation exceeding 90 % were flagged as fraudulent.</div></div>\",\"PeriodicalId\":21153,\"journal\":{\"name\":\"Resources Conservation and Recycling\",\"volume\":\"221 \",\"pages\":\"Article 108407\"},\"PeriodicalIF\":11.2000,\"publicationDate\":\"2025-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Resources Conservation and Recycling\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S092134492500285X\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Resources Conservation and Recycling","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092134492500285X","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

工业企业是污染物的主要来源,准确报告这些企业的污染物排放对有效的环境管理至关重要。然而,虚假报告,特别是少报有害废物(HW)的产生,已成为一个棘手的问题。HW很难自动监测,少报往往意味着不当处理或非法倾倒,这可能造成严重的环境风险。由于缺乏对公司层面的理论HW产生的先验知识,监管机构在检测漏报方面面临挑战。本研究提出了一个机器学习框架,通过将HW的产生与企业的静态特征和实时制造活动(如废水排放)相关联,来检测欺诈报告。该模型在中国江苏的两个工业部门进行了测试,数据集包含10%的欺诈样本。当低报HW产生概率超过90%的观测结果被标记为欺诈时,它的准确性达到42.9% - 55%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Probabilistic evaluation of enterprises' fraudulent report in pollutant levels based on machine learning: a case of hazardous waste generation quantity
Industrial enterprises are prominent sources of contaminants, and accurate reporting of contaminant discharge by these enterprises is essential for effective environmental management. However, fraudulent reporting, particularly underreporting hazardous waste (HW) generation, has become an intractable issue. HW is difficult to monitor automatically, and underreporting often implies improper treatment or illegal dumping, which can cause severe environmental risks. Regulators face challenges in detecting underreporting due to the lack of prior knowledge about the theoretical HW generation at the firm level. This study proposes a machine learning framework to detect fraudulent reporting by correlating HW generation with enterprises’ static characteristics and real-time manufacturing activities, such as wastewater discharge. The model was tested on two industrial sectors in Jiangsu, China, with datasets containing 10 % fraudulent samples. It achieved an accuracy of 42.9 %-55 % when observations with a probability of underreporting HW generation exceeding 90 % were flagged as fraudulent.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Resources Conservation and Recycling
Resources Conservation and Recycling 环境科学-工程:环境
CiteScore
22.90
自引率
6.10%
发文量
625
审稿时长
23 days
期刊介绍: The journal Resources, Conservation & Recycling welcomes contributions from research, which consider sustainable management and conservation of resources. The journal prioritizes understanding the transformation processes crucial for transitioning toward more sustainable production and consumption systems. It highlights technological, economic, institutional, and policy aspects related to specific resource management practices such as conservation, recycling, and resource substitution, as well as broader strategies like improving resource productivity and restructuring production and consumption patterns. Contributions may address regional, national, or international scales and can range from individual resources or technologies to entire sectors or systems. Authors are encouraged to explore scientific and methodological issues alongside practical, environmental, and economic implications. However, manuscripts focusing solely on laboratory experiments without discussing their broader implications will not be considered for publication in the journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信