Leveraging synthetic trace generation of modeling operations for intelligent modeling assistants using large language models

IF 4.3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information and Software Technology Pub Date : 2025-06-16 DOI:10.1016/j.infsof.2025.107806

Vittoriano Muttillo , Claudio Di Sipio , Riccardo Rubei , Luca Berardinelli

{"title":"Leveraging synthetic trace generation of modeling operations for intelligent modeling assistants using large language models","authors":"Vittoriano Muttillo , Claudio Di Sipio , Riccardo Rubei , Luca Berardinelli","doi":"10.1016/j.infsof.2025.107806","DOIUrl":null,"url":null,"abstract":"<div><h3>Context:</h3><div>Due to the proliferation of generative AI models in different software engineering tasks, the research community has started to exploit those models, spanning from requirement specification to code development. Model-Driven Engineering (MDE) is a paradigm that leverages software models as primary artifacts to automate tasks. In this respect, modelers have started to investigate the interplay between traditional MDE practices and Large Language Models (LLMs) to push automation. Although powerful, LLMs exhibit limitations that undermine the quality of generated modeling artifacts, e.g., hallucination or incorrect formatting. Recording modeling operations relies on human-based activities to train modeling assistants, helping modelers in their daily tasks. Nevertheless, those techniques require a huge amount of training data that cannot be available due to several factors, e.g., security or privacy issues.</div></div><div><h3>Objective:</h3><div>In this paper, we propose an extension of a conceptual MDE framework, called MASTER-LLM, that combines different MDE tools and paradigms to support industrial and academic practitioners.</div></div><div><h3>Method:</h3><div>MASTER-LLM comprises a modeling environment that acts as the active context in which a dedicated component records modeling operations. Then, model completion is enabled by the modeling assistant trained on past operations. Different LLMs are used to generate a new dataset of modeling events to speed up recording and data collection.</div></div><div><h3>Results:</h3><div>To evaluate the feasibility of MASTER-LLM in practice, we experiment with two modeling environments, i.e., CAEX and HEPSYCODE, employed in industrial use cases within European projects. We investigate how the examined LLMs can generate realistic modeling operations in different domains.</div></div><div><h3>Conclusion:</h3><div>We show that synthetic traces can be effectively used when the application domain is less complex, while complex scenarios require human-based operations or a mixed approach according to data availability. However, generative AI models must be assessed using proper methodologies to avoid security issues in industrial domains.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"186 ","pages":"Article 107806"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information and Software Technology","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950584925001454","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Context:

Due to the proliferation of generative AI models in different software engineering tasks, the research community has started to exploit those models, spanning from requirement specification to code development. Model-Driven Engineering (MDE) is a paradigm that leverages software models as primary artifacts to automate tasks. In this respect, modelers have started to investigate the interplay between traditional MDE practices and Large Language Models (LLMs) to push automation. Although powerful, LLMs exhibit limitations that undermine the quality of generated modeling artifacts, e.g., hallucination or incorrect formatting. Recording modeling operations relies on human-based activities to train modeling assistants, helping modelers in their daily tasks. Nevertheless, those techniques require a huge amount of training data that cannot be available due to several factors, e.g., security or privacy issues.

Objective:

In this paper, we propose an extension of a conceptual MDE framework, called MASTER-LLM, that combines different MDE tools and paradigms to support industrial and academic practitioners.

Method:

MASTER-LLM comprises a modeling environment that acts as the active context in which a dedicated component records modeling operations. Then, model completion is enabled by the modeling assistant trained on past operations. Different LLMs are used to generate a new dataset of modeling events to speed up recording and data collection.

Results:

To evaluate the feasibility of MASTER-LLM in practice, we experiment with two modeling environments, i.e., CAEX and HEPSYCODE, employed in industrial use cases within European projects. We investigate how the examined LLMs can generate realistic modeling operations in different domains.

Conclusion:

We show that synthetic traces can be effectively used when the application domain is less complex, while complex scenarios require human-based operations or a mixed approach according to data availability. However, generative AI models must be assessed using proper methodologies to avoid security issues in industrial domains.

查看原文本刊更多论文

为使用大型语言模型的智能建模助手利用建模操作的合成跟踪生成

背景：由于在不同的软件工程任务中生成人工智能模型的激增，研究团体已经开始利用这些模型，从需求规范到代码开发。模型驱动工程（MDE）是一种范例，它利用软件模型作为主要工件来自动化任务。在这方面，建模者已经开始研究传统MDE实践和大型语言模型（llm）之间的相互作用，以推动自动化。尽管llm功能强大，但它也有一些限制，会破坏生成的建模工件的质量，例如，幻觉或不正确的格式。记录建模操作依赖于基于人类的活动来训练建模助手，帮助建模者完成日常任务。然而，这些技术需要大量的训练数据，而这些数据由于安全或隐私问题等几个因素而无法获得。目的：在本文中，我们提出了一个概念性MDE框架的扩展，称为MASTER-LLM，它结合了不同的MDE工具和范式，以支持工业和学术从业者。方法：MASTER-LLM包含一个建模环境，该环境充当活动上下文，专用组件在其中记录建模操作。然后，模型完成由经过过去操作训练的建模助手来实现。使用不同的llm来生成新的建模事件数据集，以加快记录和数据收集。结果：为了评估MASTER-LLM在实践中的可行性，我们实验了两个建模环境，即CAEX和HEPSYCODE，在欧洲项目的工业用例中使用。我们研究如何检查法学硕士可以在不同的领域产生现实的建模操作。结论：我们表明，当应用领域不太复杂时，可以有效地使用合成痕迹，而复杂的场景需要基于人工操作或根据数据可用性混合方法。然而，生成人工智能模型必须使用适当的方法进行评估，以避免工业领域的安全问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information and Software Technology 工程技术-计算机：软件工程

CiteScore

9.10

自引率

7.70%

发文量

164

审稿时长

9.6 weeks

期刊介绍： Information and Software Technology is the international archival journal focusing on research and experience that contributes to the improvement of software development practices. The journal''s scope includes methods and techniques to better engineer software and manage its development. Articles submitted for review should have a clear component of software engineering or address ways to improve the engineering and management of software development. Areas covered by the journal include: • Software management, quality and metrics, • Software processes, • Software architecture, modelling, specification, design and programming • Functional and non-functional software requirements • Software testing and verification & validation • Empirical studies of all aspects of engineering and managing software development Short Communications is a new section dedicated to short papers addressing new ideas, controversial opinions, "Negative" results and much more. Read the Guide for authors for more information. The journal encourages and welcomes submissions of systematic literature studies (reviews and maps) within the scope of the journal. Information and Software Technology is the premiere outlet for systematic literature studies in software engineering.