{"title":"RAIDAD: A model-driven framework for automated and agile development of IoT data analysis software","authors":"Mohsen Gholami, Bahman Zamani, Behrouz Shahgholi Ghahfarokhi","doi":"10.1016/j.infsof.2025.107818","DOIUrl":null,"url":null,"abstract":"<div><h3>Context:</h3><div>Nowadays, developing data analysis software for the IoT domain faces challenges such as complexity, repetitive tasks, and developers’ lack of domain knowledge. To address these issues, methodologies like CRISP-DM have been introduced, providing structured guidance for data analysis.</div></div><div><h3>Objectives:</h3><div>Despite the availability of structured methodologies, building data analysis pipelines still involves managing complexity and redundancy. Model-driven approaches have been proposed to tackle these challenges but often fail to address all stages of the data analysis workflow and the interdependencies between stages and datasets comprehensively. This research introduces RAIDAD, a model-driven framework that addresses these gaps by covering all phases of the CRISP-DM methodology.</div></div><div><h3>Methods:</h3><div>RAIDAD includes a domain-specific modeling language for IoT data analysis, a graphical modeling editor, a code generation transformation engine, and a data model assistant for seamless model-data integration. These components are delivered as an Eclipse plugin.</div></div><div><h3>Results:</h3><div>The evaluation of RAIDAD is two-fold. First, a comparative operational evaluation with RapidMiner and ML-Quadrat shows RAIDAD achieves a 9.6% improvement in usability and productivity over RapidMiner and a 23% improvement over ML-Quadrat. Second, RAIDAD is compared to a general-purpose programming language, demonstrating its superiority in reducing effort and production time for IoT data analysis software.</div></div><div><h3>Conclusion:</h3><div>This comprehensive framework ensures an efficient and organized approach to data analysis, addressing key challenges in the IoT domain. Future research will focus on expanding RAIDAD’s support for a wider range of data analysis and machine learning algorithms, enhancing automation capabilities, and incorporating continuous user feedback to ensure the framework evolves in line with emerging needs.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"187 ","pages":"Article 107818"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information and Software Technology","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950584925001570","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Context:
Nowadays, developing data analysis software for the IoT domain faces challenges such as complexity, repetitive tasks, and developers’ lack of domain knowledge. To address these issues, methodologies like CRISP-DM have been introduced, providing structured guidance for data analysis.
Objectives:
Despite the availability of structured methodologies, building data analysis pipelines still involves managing complexity and redundancy. Model-driven approaches have been proposed to tackle these challenges but often fail to address all stages of the data analysis workflow and the interdependencies between stages and datasets comprehensively. This research introduces RAIDAD, a model-driven framework that addresses these gaps by covering all phases of the CRISP-DM methodology.
Methods:
RAIDAD includes a domain-specific modeling language for IoT data analysis, a graphical modeling editor, a code generation transformation engine, and a data model assistant for seamless model-data integration. These components are delivered as an Eclipse plugin.
Results:
The evaluation of RAIDAD is two-fold. First, a comparative operational evaluation with RapidMiner and ML-Quadrat shows RAIDAD achieves a 9.6% improvement in usability and productivity over RapidMiner and a 23% improvement over ML-Quadrat. Second, RAIDAD is compared to a general-purpose programming language, demonstrating its superiority in reducing effort and production time for IoT data analysis software.
Conclusion:
This comprehensive framework ensures an efficient and organized approach to data analysis, addressing key challenges in the IoT domain. Future research will focus on expanding RAIDAD’s support for a wider range of data analysis and machine learning algorithms, enhancing automation capabilities, and incorporating continuous user feedback to ensure the framework evolves in line with emerging needs.
期刊介绍:
Information and Software Technology is the international archival journal focusing on research and experience that contributes to the improvement of software development practices. The journal''s scope includes methods and techniques to better engineer software and manage its development. Articles submitted for review should have a clear component of software engineering or address ways to improve the engineering and management of software development. Areas covered by the journal include:
• Software management, quality and metrics,
• Software processes,
• Software architecture, modelling, specification, design and programming
• Functional and non-functional software requirements
• Software testing and verification & validation
• Empirical studies of all aspects of engineering and managing software development
Short Communications is a new section dedicated to short papers addressing new ideas, controversial opinions, "Negative" results and much more. Read the Guide for authors for more information.
The journal encourages and welcomes submissions of systematic literature studies (reviews and maps) within the scope of the journal. Information and Software Technology is the premiere outlet for systematic literature studies in software engineering.