{"title":"BEKG:建筑环境知识图谱","authors":"Xiaojun Yang, Haoyu Zhong, Zhengdong Wang, Penglin Du, Keyi Zhou, Heping Zhou, Xingjin Lai, Yik Lun Lau, Yangqiu Song, Liyaning Tang","doi":"10.1080/09613218.2023.2238851","DOIUrl":null,"url":null,"abstract":"In recent years, the digitalization of the built environment has progressed rapidly due to the development of modern design and construction technologies. However, the need for extensive professional knowledge in this field has not been met by practitioners and scholars. To address this problem, a study was conducted to build a knowledge graph in the built environment domain, which stores entities and their connections in a graph data model. To achieve it, this research collected more than 80,000 paper abstracts from the built environment domain. To ensure the accuracy of entities and relationships in the knowledge graph, two well-annotated datasets were created with 29 types of relationships, each containing 2000 and 1450 instances, respectively, for Named Entity Recognition (NER) and relationship extraction (RE) tasks. Two BERT-based models were trained on these datasets and achieved over 85% accuracy in both tasks. Using these models, over 200,000 high-quality relationships and entities were extracted from abstract data. This comprehensive knowledge graph will help practitioners and scholars better understand the built environment domain.","PeriodicalId":55316,"journal":{"name":"Building Research and Information","volume":"37 1","pages":"0"},"PeriodicalIF":3.7000,"publicationDate":"2023-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"BEKG: A built environment knowledge graph\",\"authors\":\"Xiaojun Yang, Haoyu Zhong, Zhengdong Wang, Penglin Du, Keyi Zhou, Heping Zhou, Xingjin Lai, Yik Lun Lau, Yangqiu Song, Liyaning Tang\",\"doi\":\"10.1080/09613218.2023.2238851\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the digitalization of the built environment has progressed rapidly due to the development of modern design and construction technologies. However, the need for extensive professional knowledge in this field has not been met by practitioners and scholars. To address this problem, a study was conducted to build a knowledge graph in the built environment domain, which stores entities and their connections in a graph data model. To achieve it, this research collected more than 80,000 paper abstracts from the built environment domain. To ensure the accuracy of entities and relationships in the knowledge graph, two well-annotated datasets were created with 29 types of relationships, each containing 2000 and 1450 instances, respectively, for Named Entity Recognition (NER) and relationship extraction (RE) tasks. Two BERT-based models were trained on these datasets and achieved over 85% accuracy in both tasks. Using these models, over 200,000 high-quality relationships and entities were extracted from abstract data. This comprehensive knowledge graph will help practitioners and scholars better understand the built environment domain.\",\"PeriodicalId\":55316,\"journal\":{\"name\":\"Building Research and Information\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2023-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Building Research and Information\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/09613218.2023.2238851\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building Research and Information","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/09613218.2023.2238851","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
In recent years, the digitalization of the built environment has progressed rapidly due to the development of modern design and construction technologies. However, the need for extensive professional knowledge in this field has not been met by practitioners and scholars. To address this problem, a study was conducted to build a knowledge graph in the built environment domain, which stores entities and their connections in a graph data model. To achieve it, this research collected more than 80,000 paper abstracts from the built environment domain. To ensure the accuracy of entities and relationships in the knowledge graph, two well-annotated datasets were created with 29 types of relationships, each containing 2000 and 1450 instances, respectively, for Named Entity Recognition (NER) and relationship extraction (RE) tasks. Two BERT-based models were trained on these datasets and achieved over 85% accuracy in both tasks. Using these models, over 200,000 high-quality relationships and entities were extracted from abstract data. This comprehensive knowledge graph will help practitioners and scholars better understand the built environment domain.
期刊介绍:
BUILDING RESEARCH & INFORMATION (BRI) is a leading international refereed journal focussed on buildings and their supporting systems. Unique to BRI is a focus on a holistic, transdisciplinary approach to buildings and the complexity of issues involving the built environment with other systems over the course of their life: planning, briefing, design, construction, occupation and use, property exchange and evaluation, maintenance, alteration and end of life. Published articles provide conceptual and evidence-based approaches which reflect the complexity and linkages between cultural, environmental, economic, social, organisational, quality of life, health, well-being, design and engineering of the built environment.