Ricardo Accorsi Casonatto , Tales De Pádua Grillo Souza , Ari Melo Mariano
{"title":"数据挖掘中的质量与风险管理:CRISP-DM 视角。","authors":"Ricardo Accorsi Casonatto , Tales De Pádua Grillo Souza , Ari Melo Mariano","doi":"10.1016/j.procs.2024.08.257","DOIUrl":null,"url":null,"abstract":"<div><p>The area of data science knowledge responsible for dealing with this new reality is diffuse, including mathematics, statistics, computing, engineering, psychology, and administration, among many other areas that make up a new scenario that is still changing. Different models have emerged over the years to systematize the procedures to be followed. Among them, CRISP-DM (Cross Industry Standard Process for Data Mining) has become one of the most widespread in the industry. However, the lack of detailed instructions means the framework is often incorrectly used. Therefore, this research aims to present a utilitarian and didactic model based on the latest advances in the literature and through the lens of production engineering. In order to achieve this objective, exploratory research was carried out based on a systematic review and subsequent categorization of each of the CRISP-DM steps, detailing the authors’ contributions to each stage. In addition, it is proposed that guidelines from the areas of Quality Management and Risk Management be added to the subject, consolidating a useful and didactic model of relevance.</p></div>","PeriodicalId":20465,"journal":{"name":"Procedia Computer Science","volume":"242 ","pages":"Pages 161-168"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1877050924019768/pdf?md5=db8eb2579aadeaa41b23b028ffc4301a&pid=1-s2.0-S1877050924019768-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Quality and Risk Management in Data Mining: A CRISP-DM Perspective.\",\"authors\":\"Ricardo Accorsi Casonatto , Tales De Pádua Grillo Souza , Ari Melo Mariano\",\"doi\":\"10.1016/j.procs.2024.08.257\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The area of data science knowledge responsible for dealing with this new reality is diffuse, including mathematics, statistics, computing, engineering, psychology, and administration, among many other areas that make up a new scenario that is still changing. Different models have emerged over the years to systematize the procedures to be followed. Among them, CRISP-DM (Cross Industry Standard Process for Data Mining) has become one of the most widespread in the industry. However, the lack of detailed instructions means the framework is often incorrectly used. Therefore, this research aims to present a utilitarian and didactic model based on the latest advances in the literature and through the lens of production engineering. In order to achieve this objective, exploratory research was carried out based on a systematic review and subsequent categorization of each of the CRISP-DM steps, detailing the authors’ contributions to each stage. In addition, it is proposed that guidelines from the areas of Quality Management and Risk Management be added to the subject, consolidating a useful and didactic model of relevance.</p></div>\",\"PeriodicalId\":20465,\"journal\":{\"name\":\"Procedia Computer Science\",\"volume\":\"242 \",\"pages\":\"Pages 161-168\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S1877050924019768/pdf?md5=db8eb2579aadeaa41b23b028ffc4301a&pid=1-s2.0-S1877050924019768-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Procedia Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1877050924019768\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Procedia Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877050924019768","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Quality and Risk Management in Data Mining: A CRISP-DM Perspective.
The area of data science knowledge responsible for dealing with this new reality is diffuse, including mathematics, statistics, computing, engineering, psychology, and administration, among many other areas that make up a new scenario that is still changing. Different models have emerged over the years to systematize the procedures to be followed. Among them, CRISP-DM (Cross Industry Standard Process for Data Mining) has become one of the most widespread in the industry. However, the lack of detailed instructions means the framework is often incorrectly used. Therefore, this research aims to present a utilitarian and didactic model based on the latest advances in the literature and through the lens of production engineering. In order to achieve this objective, exploratory research was carried out based on a systematic review and subsequent categorization of each of the CRISP-DM steps, detailing the authors’ contributions to each stage. In addition, it is proposed that guidelines from the areas of Quality Management and Risk Management be added to the subject, consolidating a useful and didactic model of relevance.