{"title":"Performance benchmarking of multimodal data-driven approaches in industrial settings","authors":"Diyar Altinses, Andreas Schwung","doi":"10.1016/j.mlwa.2025.100691","DOIUrl":null,"url":null,"abstract":"<div><div>Data-driven solutions are increasingly transforming the industrial sector, yet collecting large-scale, multimodal datasets remains costly and challenging. This paper presents three synthetic multimodal datasets that replicate real-world industrial conditions across varying levels of complexity, designed to benchmark multimodal machine learning models. We validate their utility through a series of experiments. Cross-modal prediction and domain adaptation demonstrate that the datasets effectively capture strong multimodal correlations. Multimodal reconstruction experiments confirm the internal consistency and richness of the fused representations, indicating that the modalities complement each other in capturing underlying structure. Additionally, multimodal regression significantly outperforms unimodal baselines, underscoring the predictive strength gained through multimodal integration. Together, these results demonstrate the utility of our datasets, establishing a solid baseline for future research and encouraging further advancements in industrial data-driven solutions.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"21 ","pages":"Article 100691"},"PeriodicalIF":4.9000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning with applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266682702500074X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Data-driven solutions are increasingly transforming the industrial sector, yet collecting large-scale, multimodal datasets remains costly and challenging. This paper presents three synthetic multimodal datasets that replicate real-world industrial conditions across varying levels of complexity, designed to benchmark multimodal machine learning models. We validate their utility through a series of experiments. Cross-modal prediction and domain adaptation demonstrate that the datasets effectively capture strong multimodal correlations. Multimodal reconstruction experiments confirm the internal consistency and richness of the fused representations, indicating that the modalities complement each other in capturing underlying structure. Additionally, multimodal regression significantly outperforms unimodal baselines, underscoring the predictive strength gained through multimodal integration. Together, these results demonstrate the utility of our datasets, establishing a solid baseline for future research and encouraging further advancements in industrial data-driven solutions.