Moisés Silva-Muñoz , Jonnatan Oyarzún , Gustavo Semaan , Carlos Contreras-Bolton , Carlos Rey , Victor Parada
{"title":"通过自动生成的算法自动聚类","authors":"Moisés Silva-Muñoz , Jonnatan Oyarzún , Gustavo Semaan , Carlos Contreras-Bolton , Carlos Rey , Victor Parada","doi":"10.1016/j.engappai.2025.111596","DOIUrl":null,"url":null,"abstract":"<div><div>Clustering data based on similarity becomes particularly challenging when the number of clusters is not known in advance. This case, known as the automatic clustering problem (ACP), corresponds to an optimization problem that aims to identify the best possible clustering among the many existing options. Although several effective ACP methods have been proposed, identifying optimal clusterings remains a difficult task, and the space of algorithmic solutions has yet to be thoroughly explored. Existing approaches suggest that better results can be achieved by appropriately combining and assembling different techniques. While some combinations have been explored, many others remain unexamined and could be evaluated through a more exhaustive exploration, such as the automatic generation of algorithms (AGA). This article considers the combinations arising from the automatic construction of algorithms for the ACP. To this end, an optimization meta-problem is defined to construct algorithms with the best computational performance. The search for the optimal solution to the meta-problem allows a computational exploration of the space defined by all possible combinations of elementary algorithmic components. We specifically explore the potential of AGA to generate ACP-specialized algorithms tailored to each dataset. Through extensive computational experiments, we evaluate the effectiveness of these specialized algorithms with general-purpose algorithms generated by AGA and six state-of-the-art ACP algorithms across well-established datasets. The results demonstrate that both AGA-generated algorithms outperform the state-of-the-art ACP algorithms, with statistically significant differences. Furthermore, the specialized algorithms exhibit superior effectiveness, highlighting their advantage over their general-purpose counterparts.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111596"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic clustering by automatically generated algorithms\",\"authors\":\"Moisés Silva-Muñoz , Jonnatan Oyarzún , Gustavo Semaan , Carlos Contreras-Bolton , Carlos Rey , Victor Parada\",\"doi\":\"10.1016/j.engappai.2025.111596\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Clustering data based on similarity becomes particularly challenging when the number of clusters is not known in advance. This case, known as the automatic clustering problem (ACP), corresponds to an optimization problem that aims to identify the best possible clustering among the many existing options. Although several effective ACP methods have been proposed, identifying optimal clusterings remains a difficult task, and the space of algorithmic solutions has yet to be thoroughly explored. Existing approaches suggest that better results can be achieved by appropriately combining and assembling different techniques. While some combinations have been explored, many others remain unexamined and could be evaluated through a more exhaustive exploration, such as the automatic generation of algorithms (AGA). This article considers the combinations arising from the automatic construction of algorithms for the ACP. To this end, an optimization meta-problem is defined to construct algorithms with the best computational performance. The search for the optimal solution to the meta-problem allows a computational exploration of the space defined by all possible combinations of elementary algorithmic components. We specifically explore the potential of AGA to generate ACP-specialized algorithms tailored to each dataset. Through extensive computational experiments, we evaluate the effectiveness of these specialized algorithms with general-purpose algorithms generated by AGA and six state-of-the-art ACP algorithms across well-established datasets. The results demonstrate that both AGA-generated algorithms outperform the state-of-the-art ACP algorithms, with statistically significant differences. Furthermore, the specialized algorithms exhibit superior effectiveness, highlighting their advantage over their general-purpose counterparts.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"159 \",\"pages\":\"Article 111596\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625015982\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625015982","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Automatic clustering by automatically generated algorithms
Clustering data based on similarity becomes particularly challenging when the number of clusters is not known in advance. This case, known as the automatic clustering problem (ACP), corresponds to an optimization problem that aims to identify the best possible clustering among the many existing options. Although several effective ACP methods have been proposed, identifying optimal clusterings remains a difficult task, and the space of algorithmic solutions has yet to be thoroughly explored. Existing approaches suggest that better results can be achieved by appropriately combining and assembling different techniques. While some combinations have been explored, many others remain unexamined and could be evaluated through a more exhaustive exploration, such as the automatic generation of algorithms (AGA). This article considers the combinations arising from the automatic construction of algorithms for the ACP. To this end, an optimization meta-problem is defined to construct algorithms with the best computational performance. The search for the optimal solution to the meta-problem allows a computational exploration of the space defined by all possible combinations of elementary algorithmic components. We specifically explore the potential of AGA to generate ACP-specialized algorithms tailored to each dataset. Through extensive computational experiments, we evaluate the effectiveness of these specialized algorithms with general-purpose algorithms generated by AGA and six state-of-the-art ACP algorithms across well-established datasets. The results demonstrate that both AGA-generated algorithms outperform the state-of-the-art ACP algorithms, with statistically significant differences. Furthermore, the specialized algorithms exhibit superior effectiveness, highlighting their advantage over their general-purpose counterparts.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.