{"title":"Can a machine talk the talk though not climb the rock? A Turing Test on rock climbing","authors":"Otto Segersven, Ilkka Arminen","doi":"10.1016/j.dcm.2025.100915","DOIUrl":null,"url":null,"abstract":"<div><div>Large Language Models demonstrate considerable fluency in human discourse. Despite their potentially transformative impact, their limits and capabilities are yet to be discovered. To mitigate potential harm and harness their potential for the benefit of society, it is important to understand their capabilities in human–machine interaction. To address this challenge, we present results from a pilot study involving rock climbers and ChatGPT-4. In our Task-Specific Turing Test, expert group members ask any question they believe will distinguish between the machine and a fellow member. The paper employs a perspective which focuses on expert discourses of social groups and their linguistic competence in expressing and demonstrating their expertise. Results show that ChatGPT is successful in passing as a rock climber in several areas of discourse but (still) falls short in one area. Experiential knowledge – in particular, embodiment – proved a revealing distinction between human and machine. We conclude by emphasizing that the language skills displayed by LLMs ultimately stems from human-AI ensembles.</div></div>","PeriodicalId":46649,"journal":{"name":"Discourse Context & Media","volume":"67 ","pages":"Article 100915"},"PeriodicalIF":3.1000,"publicationDate":"2025-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Discourse Context & Media","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211695825000649","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMMUNICATION","Score":null,"Total":0}
引用次数: 0
Abstract
Large Language Models demonstrate considerable fluency in human discourse. Despite their potentially transformative impact, their limits and capabilities are yet to be discovered. To mitigate potential harm and harness their potential for the benefit of society, it is important to understand their capabilities in human–machine interaction. To address this challenge, we present results from a pilot study involving rock climbers and ChatGPT-4. In our Task-Specific Turing Test, expert group members ask any question they believe will distinguish between the machine and a fellow member. The paper employs a perspective which focuses on expert discourses of social groups and their linguistic competence in expressing and demonstrating their expertise. Results show that ChatGPT is successful in passing as a rock climber in several areas of discourse but (still) falls short in one area. Experiential knowledge – in particular, embodiment – proved a revealing distinction between human and machine. We conclude by emphasizing that the language skills displayed by LLMs ultimately stems from human-AI ensembles.