{"title":"Towards robotic deep spatiotemporal language understanding based on mental-image-directed semantic theory","authors":"Rojanee Khummongkol, Masao Yokota","doi":"10.1007/s10015-023-00905-8","DOIUrl":null,"url":null,"abstract":"<div><p>Among subsets of natural language, spatial language, more exactly spatiotemporal language here, has been considered most essential for human-like interaction between people and robots expected in near future. Quite distinctively from conventional learning-based approaches to natural language understanding (NLU), mental-image-directed theory (MIDST) proposes a robotic deep NLU methodology based on a mental image model as a formal system for knowledge representation and reasoning. The application system named CoMaS is designed to understand User’s utterances in text and respond in text or animation through human-like spatiotemporal reasoning based on the mental image model. In this work, CoMaS was compared with human subjects through a psychological experiment on spatiotemporal language understanding and showed globally good agreement with them and locally some interesting and reasonable disagreement. This kind of disagreement was found among the human participants as well and explainable as difference in personal conceptualization or reasoning based on mental image. The experimental results and theoretical discussion based on them showed well the effectiveness and uniqueness of our study.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 1","pages":"70 - 80"},"PeriodicalIF":0.8000,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Life and Robotics","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s10015-023-00905-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Among subsets of natural language, spatial language, more exactly spatiotemporal language here, has been considered most essential for human-like interaction between people and robots expected in near future. Quite distinctively from conventional learning-based approaches to natural language understanding (NLU), mental-image-directed theory (MIDST) proposes a robotic deep NLU methodology based on a mental image model as a formal system for knowledge representation and reasoning. The application system named CoMaS is designed to understand User’s utterances in text and respond in text or animation through human-like spatiotemporal reasoning based on the mental image model. In this work, CoMaS was compared with human subjects through a psychological experiment on spatiotemporal language understanding and showed globally good agreement with them and locally some interesting and reasonable disagreement. This kind of disagreement was found among the human participants as well and explainable as difference in personal conceptualization or reasoning based on mental image. The experimental results and theoretical discussion based on them showed well the effectiveness and uniqueness of our study.