Michele A. Brandão, Arthur P. G. Reis, Bárbara M. A. Mendes, Clara A. Bacha de Almeida, Gabriel P. Oliveira, Henrique Hott, Larissa D. Gomide, Lucas L. Costa, Mariana O. Silva, Anisio Lacerda, Gisele L. Pappa
{"title":"PLUS: A Semi-automated Pipeline for Fraud Detection in Public Bids","authors":"Michele A. Brandão, Arthur P. G. Reis, Bárbara M. A. Mendes, Clara A. Bacha de Almeida, Gabriel P. Oliveira, Henrique Hott, Larissa D. Gomide, Lucas L. Costa, Mariana O. Silva, Anisio Lacerda, Gisele L. Pappa","doi":"10.1145/3616396","DOIUrl":null,"url":null,"abstract":"The diversity of sources and formats of public bidding documents makes collecting, processing, and organizing such documents challenging from the point of view of data analysis. Thus, the development of approaches to deal with such data is relevant since the analysis of them allows to expand of the inclusion of people as they have more access to public decisions and expenditures, increase transparency in the public sector and give citizens a greater sense of responsibility for having different points of view on the government’s performance in meeting its public policy goals. In this context, we propose PLUS, a semi-automated pipeline for fraud detection in public bids. PLUS comprises a heuristic meta-classifier for bidding documents and a data quality module. Both modules present promising results after a proof of concept, reinforcing the relevance of PLUS for automating the bidding process investigation. Then, we present two applications of PLUS on real-world data: the construction of audit trails for fraud detection and a price database for overpricing detection. Such applications evidence a significant reduction of specialists’ work searching for irregularities in public bids.","PeriodicalId":93488,"journal":{"name":"Proceedings of the ... International Conference on Digital Government Research. International Conference on Digital Government Research","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Conference on Digital Government Research. International Conference on Digital Government Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3616396","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The diversity of sources and formats of public bidding documents makes collecting, processing, and organizing such documents challenging from the point of view of data analysis. Thus, the development of approaches to deal with such data is relevant since the analysis of them allows to expand of the inclusion of people as they have more access to public decisions and expenditures, increase transparency in the public sector and give citizens a greater sense of responsibility for having different points of view on the government’s performance in meeting its public policy goals. In this context, we propose PLUS, a semi-automated pipeline for fraud detection in public bids. PLUS comprises a heuristic meta-classifier for bidding documents and a data quality module. Both modules present promising results after a proof of concept, reinforcing the relevance of PLUS for automating the bidding process investigation. Then, we present two applications of PLUS on real-world data: the construction of audit trails for fraud detection and a price database for overpricing detection. Such applications evidence a significant reduction of specialists’ work searching for irregularities in public bids.