{"title":"Auto-summarization of the texts of construction dispute precedents","authors":"Wonkyoung Seo , Youngcheol Kang","doi":"10.1016/j.aei.2025.103381","DOIUrl":null,"url":null,"abstract":"<div><div>Advancements in text analysis are driving the adoption of document automation in the construction industry. Despite significant financial losses from construction disputes, efforts to automate document processes in this domain remain limited. Effective dispute management requires the rapid identification of relevant precedent cases to help practitioners respond appropriately. However, the complexity and length of such texts pose challenges to quick comprehension. This study presents a natural language processing (NLP) model for automatically summarizing construction dispute case texts. The model was tested on 300 U.S. construction dispute cases sourced from the Westlaw database. Various NLP models, including large language models (LLMs) such as OpenAI’s models and BERT, were evaluated, achieving an F-score of approximately 0.39 based on the ROUGE-L metric. To accomplish the domain-specific objective of summarizing construction precedent cases, this study explored multiple approaches, including data preprocessing, fine-tuning, and model engineering using LangChain. Furthermore, this study aims to develop models for summarizing legal precedent texts and investigates methods to capture the distinctive characteristics of construction dispute data compared to general legal texts. The models were validated through domain experts who recognize the unique nature of construction disputes, enhancing the reliability of the evaluation process. The findings contribute significantly to the automation of construction dispute document summarization, enabling practitioners to manage such cases more efficiently.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103381"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625002745","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Advancements in text analysis are driving the adoption of document automation in the construction industry. Despite significant financial losses from construction disputes, efforts to automate document processes in this domain remain limited. Effective dispute management requires the rapid identification of relevant precedent cases to help practitioners respond appropriately. However, the complexity and length of such texts pose challenges to quick comprehension. This study presents a natural language processing (NLP) model for automatically summarizing construction dispute case texts. The model was tested on 300 U.S. construction dispute cases sourced from the Westlaw database. Various NLP models, including large language models (LLMs) such as OpenAI’s models and BERT, were evaluated, achieving an F-score of approximately 0.39 based on the ROUGE-L metric. To accomplish the domain-specific objective of summarizing construction precedent cases, this study explored multiple approaches, including data preprocessing, fine-tuning, and model engineering using LangChain. Furthermore, this study aims to develop models for summarizing legal precedent texts and investigates methods to capture the distinctive characteristics of construction dispute data compared to general legal texts. The models were validated through domain experts who recognize the unique nature of construction disputes, enhancing the reliability of the evaluation process. The findings contribute significantly to the automation of construction dispute document summarization, enabling practitioners to manage such cases more efficiently.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.