{"title":"Knowledge-enhanced ontology-to-vector for automated ontology concept enrichment in BIM","authors":"Yinyi Wei, Xiao Li","doi":"10.1016/j.jii.2025.100836","DOIUrl":null,"url":null,"abstract":"<div><div>Building Information Modeling (BIM) relies on standardized ontologies like IfcOWL to address interoperability. However, the increasing complexity and diversity of construction information requirements demand automated enrichment of BIM ontologies, which is hindered by several factors, including complexity in ontology structure, scalability limitations, and domain-specific issues. Manual curation and maintenance of ontologies are labor-intensive and time-consuming, particularly as the scope of BIM projects expands. Despite these challenges, the construction industry lacks an effective automated approach for ontology concept enrichment. Thus, this study proposes a knowledge-enhanced ontology-to-vector (Keno2Vec) approach for automated BIM ontology concept enrichment, which can (1) encode ontology elements into meaningful and semantically rich embeddings by employing the BERT model to integrate both ontological information (names and labels) and external knowledge (definitions from authoritative knowledge bases), effectively addressing the domain expression specificity and complexity of BIM ontologies; and (2) provide a flexible framework that supports various downstream tasks of ontology concept enrichment by utilizing the resulting embeddings, thereby improving the task-specific adaptability and variability. Experimental results on datasets derived from the large-scale ifcOWL and two smaller BIM ontologies demonstrate that Keno2Vec significantly outperforms existing ontology embedding approaches in terms of accuracy and adaptability. For example, Keno2Vec achieves F1 scores on ifcOWL of nearly 87 % for subsumption prediction, 60 % for property identification, 95 % for membership recognition, and 100 % and 90 % for category-based and schema-based concept classification, respectively. Additional analysis highlights the potential of Keno2Vec for improving BIM ontology encoding and benefiting downstream applications.</div></div>","PeriodicalId":55975,"journal":{"name":"Journal of Industrial Information Integration","volume":"45 ","pages":"Article 100836"},"PeriodicalIF":10.4000,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Industrial Information Integration","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2452414X25000603","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Building Information Modeling (BIM) relies on standardized ontologies like IfcOWL to address interoperability. However, the increasing complexity and diversity of construction information requirements demand automated enrichment of BIM ontologies, which is hindered by several factors, including complexity in ontology structure, scalability limitations, and domain-specific issues. Manual curation and maintenance of ontologies are labor-intensive and time-consuming, particularly as the scope of BIM projects expands. Despite these challenges, the construction industry lacks an effective automated approach for ontology concept enrichment. Thus, this study proposes a knowledge-enhanced ontology-to-vector (Keno2Vec) approach for automated BIM ontology concept enrichment, which can (1) encode ontology elements into meaningful and semantically rich embeddings by employing the BERT model to integrate both ontological information (names and labels) and external knowledge (definitions from authoritative knowledge bases), effectively addressing the domain expression specificity and complexity of BIM ontologies; and (2) provide a flexible framework that supports various downstream tasks of ontology concept enrichment by utilizing the resulting embeddings, thereby improving the task-specific adaptability and variability. Experimental results on datasets derived from the large-scale ifcOWL and two smaller BIM ontologies demonstrate that Keno2Vec significantly outperforms existing ontology embedding approaches in terms of accuracy and adaptability. For example, Keno2Vec achieves F1 scores on ifcOWL of nearly 87 % for subsumption prediction, 60 % for property identification, 95 % for membership recognition, and 100 % and 90 % for category-based and schema-based concept classification, respectively. Additional analysis highlights the potential of Keno2Vec for improving BIM ontology encoding and benefiting downstream applications.
期刊介绍:
The Journal of Industrial Information Integration focuses on the industry's transition towards industrial integration and informatization, covering not only hardware and software but also information integration. It serves as a platform for promoting advances in industrial information integration, addressing challenges, issues, and solutions in an interdisciplinary forum for researchers, practitioners, and policy makers.
The Journal of Industrial Information Integration welcomes papers on foundational, technical, and practical aspects of industrial information integration, emphasizing the complex and cross-disciplinary topics that arise in industrial integration. Techniques from mathematical science, computer science, computer engineering, electrical and electronic engineering, manufacturing engineering, and engineering management are crucial in this context.