Of rats and robots: A mutual learning paradigm

IF 1.4 3区 心理学 Q4 BEHAVIORAL SCIENCES
Oguzcan Nas, Defne Albayrak, Gunes Unal
{"title":"Of rats and robots: A mutual learning paradigm","authors":"Oguzcan Nas,&nbsp;Defne Albayrak,&nbsp;Gunes Unal","doi":"10.1002/jeab.70004","DOIUrl":null,"url":null,"abstract":"<p>Robots are increasingly used alongside Skinner boxes to train animals in operant conditioning tasks. Similarly, animals are being employed in artificial intelligence research to train various algorithms. However, both types of experiments rely on unidirectional learning, where one partner—the animal or the robot—acts as the teacher and the other as the student. Here, we present a novel animal–robot interaction paradigm that enables bidirectional, or mutual, learning between a Wistar rat and a robot. The two agents interacted with each other to achieve specific goals, dynamically adjusting their actions based on the positive (rewarding) or negative (punishing) signals provided by their partner. The paradigm was tested in silico with two artificial reinforcement learning agents and in vivo with different rat–robot pairs. In the virtual trials, both agents were able to adapt their behavior toward reward maximization, achieving mutual learning. The in vivo experiments revealed that rats rapidly acquired the behaviors necessary to receive the reward and exhibited passive avoidance learning for negative signals when the robot displayed a steep learning curve. The developed paradigm can be used in various animal–machine interactions to test the efficacy of different learning rules and reinforcement schedules.</p>","PeriodicalId":17411,"journal":{"name":"Journal of the experimental analysis of behavior","volume":"123 2","pages":"176-201"},"PeriodicalIF":1.4000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jeab.70004","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the experimental analysis of behavior","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jeab.70004","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Robots are increasingly used alongside Skinner boxes to train animals in operant conditioning tasks. Similarly, animals are being employed in artificial intelligence research to train various algorithms. However, both types of experiments rely on unidirectional learning, where one partner—the animal or the robot—acts as the teacher and the other as the student. Here, we present a novel animal–robot interaction paradigm that enables bidirectional, or mutual, learning between a Wistar rat and a robot. The two agents interacted with each other to achieve specific goals, dynamically adjusting their actions based on the positive (rewarding) or negative (punishing) signals provided by their partner. The paradigm was tested in silico with two artificial reinforcement learning agents and in vivo with different rat–robot pairs. In the virtual trials, both agents were able to adapt their behavior toward reward maximization, achieving mutual learning. The in vivo experiments revealed that rats rapidly acquired the behaviors necessary to receive the reward and exhibited passive avoidance learning for negative signals when the robot displayed a steep learning curve. The developed paradigm can be used in various animal–machine interactions to test the efficacy of different learning rules and reinforcement schedules.

Abstract Image

老鼠和机器人:一个相互学习的范例。
机器人越来越多地与斯金纳箱一起用于训练动物进行操作性条件反射任务。同样,人工智能研究也在使用动物来训练各种算法。然而,这两种类型的实验都依赖于单向学习,其中一方——动物或机器人——充当老师,另一方充当学生。在这里,我们提出了一种新的动物-机器人交互范式,使Wistar大鼠和机器人之间能够双向或相互学习。两个智能体相互作用,以实现特定的目标,根据他们的伙伴提供的积极(奖励)或消极(惩罚)信号动态调整他们的行动。用两个人工强化学习代理在计算机上和不同的大鼠-机器人对体内对该范式进行了测试。在虚拟试验中,两个主体都能够调整自己的行为以实现奖励最大化,从而实现相互学习。体内实验表明,当机器人表现出陡峭的学习曲线时,大鼠迅速获得了接受奖励所需的行为,并表现出对负面信号的被动回避学习。开发的范式可用于各种动物-机器交互,以测试不同学习规则和强化时间表的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.90
自引率
14.80%
发文量
83
审稿时长
>12 weeks
期刊介绍: Journal of the Experimental Analysis of Behavior is primarily for the original publication of experiments relevant to the behavior of individual organisms.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信