Wen Fang , Qingyuan Yu , Wenjun Xie , Jianxun Yang , Zongwei Ma , Miaomiao Liu , Jun Bi
{"title":"基于机器学习的企业污染物水平虚报概率评估——以危险废物产生量为例","authors":"Wen Fang , Qingyuan Yu , Wenjun Xie , Jianxun Yang , Zongwei Ma , Miaomiao Liu , Jun Bi","doi":"10.1016/j.resconrec.2025.108407","DOIUrl":null,"url":null,"abstract":"<div><div>Industrial enterprises are prominent sources of contaminants, and accurate reporting of contaminant discharge by these enterprises is essential for effective environmental management. However, fraudulent reporting, particularly underreporting hazardous waste (HW) generation, has become an intractable issue. HW is difficult to monitor automatically, and underreporting often implies improper treatment or illegal dumping, which can cause severe environmental risks. Regulators face challenges in detecting underreporting due to the lack of prior knowledge about the theoretical HW generation at the firm level. This study proposes a machine learning framework to detect fraudulent reporting by correlating HW generation with enterprises’ static characteristics and real-time manufacturing activities, such as wastewater discharge. The model was tested on two industrial sectors in Jiangsu, China, with datasets containing 10 % fraudulent samples. It achieved an accuracy of 42.9 %-55 % when observations with a probability of underreporting HW generation exceeding 90 % were flagged as fraudulent.</div></div>","PeriodicalId":21153,"journal":{"name":"Resources Conservation and Recycling","volume":"221 ","pages":"Article 108407"},"PeriodicalIF":11.2000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Probabilistic evaluation of enterprises' fraudulent report in pollutant levels based on machine learning: a case of hazardous waste generation quantity\",\"authors\":\"Wen Fang , Qingyuan Yu , Wenjun Xie , Jianxun Yang , Zongwei Ma , Miaomiao Liu , Jun Bi\",\"doi\":\"10.1016/j.resconrec.2025.108407\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Industrial enterprises are prominent sources of contaminants, and accurate reporting of contaminant discharge by these enterprises is essential for effective environmental management. However, fraudulent reporting, particularly underreporting hazardous waste (HW) generation, has become an intractable issue. HW is difficult to monitor automatically, and underreporting often implies improper treatment or illegal dumping, which can cause severe environmental risks. Regulators face challenges in detecting underreporting due to the lack of prior knowledge about the theoretical HW generation at the firm level. This study proposes a machine learning framework to detect fraudulent reporting by correlating HW generation with enterprises’ static characteristics and real-time manufacturing activities, such as wastewater discharge. The model was tested on two industrial sectors in Jiangsu, China, with datasets containing 10 % fraudulent samples. It achieved an accuracy of 42.9 %-55 % when observations with a probability of underreporting HW generation exceeding 90 % were flagged as fraudulent.</div></div>\",\"PeriodicalId\":21153,\"journal\":{\"name\":\"Resources Conservation and Recycling\",\"volume\":\"221 \",\"pages\":\"Article 108407\"},\"PeriodicalIF\":11.2000,\"publicationDate\":\"2025-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Resources Conservation and Recycling\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S092134492500285X\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Resources Conservation and Recycling","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S092134492500285X","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
Probabilistic evaluation of enterprises' fraudulent report in pollutant levels based on machine learning: a case of hazardous waste generation quantity
Industrial enterprises are prominent sources of contaminants, and accurate reporting of contaminant discharge by these enterprises is essential for effective environmental management. However, fraudulent reporting, particularly underreporting hazardous waste (HW) generation, has become an intractable issue. HW is difficult to monitor automatically, and underreporting often implies improper treatment or illegal dumping, which can cause severe environmental risks. Regulators face challenges in detecting underreporting due to the lack of prior knowledge about the theoretical HW generation at the firm level. This study proposes a machine learning framework to detect fraudulent reporting by correlating HW generation with enterprises’ static characteristics and real-time manufacturing activities, such as wastewater discharge. The model was tested on two industrial sectors in Jiangsu, China, with datasets containing 10 % fraudulent samples. It achieved an accuracy of 42.9 %-55 % when observations with a probability of underreporting HW generation exceeding 90 % were flagged as fraudulent.
期刊介绍:
The journal Resources, Conservation & Recycling welcomes contributions from research, which consider sustainable management and conservation of resources. The journal prioritizes understanding the transformation processes crucial for transitioning toward more sustainable production and consumption systems. It highlights technological, economic, institutional, and policy aspects related to specific resource management practices such as conservation, recycling, and resource substitution, as well as broader strategies like improving resource productivity and restructuring production and consumption patterns.
Contributions may address regional, national, or international scales and can range from individual resources or technologies to entire sectors or systems. Authors are encouraged to explore scientific and methodological issues alongside practical, environmental, and economic implications. However, manuscripts focusing solely on laboratory experiments without discussing their broader implications will not be considered for publication in the journal.