Text Chunking in Human-AI Teaming of Document Classification for Urban Systems Management Using Large Language Models

IF 2.3 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

IET Smart Cities Pub Date : 2026-02-28 DOI:10.1049/smc2.70023

Joshua Rodriguez, Om Sanan, Guillermo Vizarreta-Luna, Steven A. Conrad

{"title":"Text Chunking in Human-AI Teaming of Document Classification for Urban Systems Management Using Large Language Models","authors":"Joshua Rodriguez, Om Sanan, Guillermo Vizarreta-Luna, Steven A. Conrad","doi":"10.1049/smc2.70023","DOIUrl":null,"url":null,"abstract":"<p>Urban systems management requires analysing complex textual documentation that necessitates coding and analysis to set requirements and evaluate built environment performance. This study contributes to the research of applying large language models (LLMs) to qualitative coding activities, aiming to reduce resources while maintaining comparable reliability performing like humans. Qualitative coding faces challenges such as resource limitations, bias and consistency between evaluators. We report the application of LLMs to deductively code 10 case documents for 17 common characteristics for the management of urban systems. We utilise whole-text analysis and text-chunk analysis to compare the processing of LLMs with human coding efforts using three OpenAI models. Results indicate that LLMs may perform similarly to human coders when initialised with specific deductive coding contexts. GPT-4o, o1-mini and GPT-4o-mini showed significant agreement with human raters when using a chunking method. The application of GPT-4o and GPT-4o-mini as additional raters with humans showed statistically significant agreement, indicating that the analysis of textual documents benefits from LLMs addition. The novel contribution of this paper is the domain-specific evaluation of a chunk-based prompting approach for deductive coding in urban systems management, validating human–AI teaming performance.</p>","PeriodicalId":34740,"journal":{"name":"IET Smart Cities","volume":"8 1","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2026-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/smc2.70023","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Smart Cities","FirstCategoryId":"1085","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/smc2.70023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Urban systems management requires analysing complex textual documentation that necessitates coding and analysis to set requirements and evaluate built environment performance. This study contributes to the research of applying large language models (LLMs) to qualitative coding activities, aiming to reduce resources while maintaining comparable reliability performing like humans. Qualitative coding faces challenges such as resource limitations, bias and consistency between evaluators. We report the application of LLMs to deductively code 10 case documents for 17 common characteristics for the management of urban systems. We utilise whole-text analysis and text-chunk analysis to compare the processing of LLMs with human coding efforts using three OpenAI models. Results indicate that LLMs may perform similarly to human coders when initialised with specific deductive coding contexts. GPT-4o, o1-mini and GPT-4o-mini showed significant agreement with human raters when using a chunking method. The application of GPT-4o and GPT-4o-mini as additional raters with humans showed statistically significant agreement, indicating that the analysis of textual documents benefits from LLMs addition. The novel contribution of this paper is the domain-specific evaluation of a chunk-based prompting approach for deductive coding in urban systems management, validating human–AI teaming performance.

Abstract Image

查看原文本刊更多论文

基于大型语言模型的城市系统管理文档分类人机协作中的文本组块

城市系统管理需要分析复杂的文本文件，这些文件需要编码和分析，以设定要求和评估建筑环境绩效。本研究有助于将大型语言模型（llm）应用于定性编码活动的研究，旨在减少资源的同时保持与人类相当的可靠性。定性编码面临着诸如资源限制、评估者之间的偏见和一致性等挑战。我们报告了法学硕士对城市系统管理的17个共同特征的10个案例文件进行演绎编码的应用。我们利用全文分析和文本块分析来比较llm的处理与使用三种OpenAI模型的人类编码工作。结果表明，当初始化特定的演绎编码上下文时，llm的表现可能与人类编码器相似。当使用分块方法时，gpt - 40、01 -mini和gpt - 40 -mini与人类评分者表现出显著的一致性。gpt - 40和gpt - 40 -mini作为人类的附加评分器的应用显示出统计学上显著的一致性，表明文本文档的分析受益于llm的添加。本文的新贡献是对城市系统管理中演绎编码的基于块的提示方法的领域特定评估，验证了人类-人工智能团队的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊