{"title":"Panama Papers: How Data Science Fought Corruption","authors":"M. Mukhopadhyay, Kaushik Ghosh","doi":"10.2139/ssrn.3644821","DOIUrl":null,"url":null,"abstract":"The Panama Papers are eleven million leaked electronic documents that detail financial and attorney–client information for more than two hundred thousand offshore entities. The documents were leaked in April 2016 by an anonymous whistle-blower from the database of Panamanian law firm and corporate service provider Mossack Fonseca. In this case study, we discuss how a team of international network of journalists collaborated using data mining tools to unearth vital financial fraud information from this large chunk of unstructured data. The case study begins with a prologue of dialogue between the whistle-blower and protagonist. The case is divided into nine mini chapters where we start with how a data team were formed followed by the tools used to clean and annotate the unstructured data into a graph-based database and finally using expert network of journalists to generate financial insights from the various nodes and links of the graph. We conclude the case by providing word clouds that highlights Indian connections.","PeriodicalId":319022,"journal":{"name":"Economics of Networks eJournal","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Economics of Networks eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3644821","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The Panama Papers are eleven million leaked electronic documents that detail financial and attorney–client information for more than two hundred thousand offshore entities. The documents were leaked in April 2016 by an anonymous whistle-blower from the database of Panamanian law firm and corporate service provider Mossack Fonseca. In this case study, we discuss how a team of international network of journalists collaborated using data mining tools to unearth vital financial fraud information from this large chunk of unstructured data. The case study begins with a prologue of dialogue between the whistle-blower and protagonist. The case is divided into nine mini chapters where we start with how a data team were formed followed by the tools used to clean and annotate the unstructured data into a graph-based database and finally using expert network of journalists to generate financial insights from the various nodes and links of the graph. We conclude the case by providing word clouds that highlights Indian connections.