{"title":"Learning regions for building a world model from clusters in probability distributions","authors":"W. Slowinski, Frank Guerin","doi":"10.1109/DEVLRN.2011.6037339","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037339","url":null,"abstract":"A developing agent learns a model of the world by observing regularities occurring in its sensory inputs. In a continuous domain where the model is represented by a set of rules, a significant part of the task of learning such a model is to find appropriate intervals within the continuous state variables, such that these intervals can be used to define rules whose predictions are reliable. We propose a technique to find such intervals (or regions) by means of finding clusters on approximate probability distributions of sensory variables. We compare this cluster-based method with an alternative landmark-based algorithm. We evaluate both techniques on a data log recorded in a simulation based on OpenArena, a three-dimensional first-person-perspective computer game, and demonstrate the results of how the techniques can learn rules which describe walking behaviour. While both techniques work reasonably well, the clustering approach seems to give more “natural” regions which correspond more closely to what a human would expect; we speculate that such regions should be more useful if they are to form a basis for further learning of higher order rules.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121060106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Olivier L. Georgeon, James B. Marshall, Pierre-Yves Ronot
{"title":"Early-stage vision of composite scenes for spatial learning and navigation","authors":"Olivier L. Georgeon, James B. Marshall, Pierre-Yves Ronot","doi":"10.1109/DEVLRN.2011.6037348","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037348","url":null,"abstract":"Developmental theories suggest that cognitive agents develop through an initial sensorimotor stage during which they learn sequential and spatial regularities. We implemented these views in a computer simulation. Following its intrinsic motivations, the agent autonomously learns sensorimotor contingencies and discovers permanent landmarks by which to navigate in the environment. Besides illustrating developmental theories, this model suggests new ways to implement vision and navigation in artificial systems. Specifically, we coupled a sequence learning mechanism with a visual system capable of interpreting composite visual scenes by inhibiting items that are irrelevant to the agent's current motivational state.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125194665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Teaching and executing verb phrases","authors":"D. Hewlett, Thomas J. Walsh, P. Cohen","doi":"10.1109/DEVLRN.2011.6037340","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037340","url":null,"abstract":"This paper describes a framework for an agent to learn models of verb-phrase meanings from human teachers and combine these models with environmental dynamics to enact verb commands. The framework extends prior work in apprenticeship learning and leverages recent advancements in modeling activities and planning in domains with multiple objects. We show how to both learn a verb model as a relational finite state machine and how to turn this model into reward and heuristic functions that can then be composed with an MDP model of an environment. The resulting “combined model” can then be efficiently searched by a planner to enact a verb command in this environment. Our experiments in simulated robot domains show this framework can be used to quickly teach verb commands and improves over the current state of the art method.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129766681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
K. Lohan, K. Pitsch, K. Rohlfing, K. Fischer, J. Saunders, H. Lehmann, Chrystopher L. Nehaniv, B. Wrede
{"title":"Contingency allows the robot to spot the tutor and to learn from interaction","authors":"K. Lohan, K. Pitsch, K. Rohlfing, K. Fischer, J. Saunders, H. Lehmann, Chrystopher L. Nehaniv, B. Wrede","doi":"10.1109/DEVLRN.2011.6037341","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037341","url":null,"abstract":"Aiming at artificial system learning from a human tutor elicit tutoring behavior, which we implemented on the robotic platform iCub. For the evaluation of the system with users, we considered a contingency module that is developed to elicit tutoring behavior, which we then evaluate by implementing this module on the robotic platform iCub and within an interaction with the users. For the evaluation of our system, we consider not only the participant's behavior but also the system's log-files as dependent variables (as it was suggested in [15] for the improvement of HRI design). We further applied Sequential Analysis as a qualitative method that provides micro-analytical insights into the sequential structure of the interaction. This way, we are able to investigate a closer interrelationship between robot's and tutor's actions and how they respond to each other. We focus on two cases: In the first case, the system module was reacting to the interaction partner appropriately; in the second case, the contingency module failed to spot the tutor. We found that the contingency module enables the robot to engage in an interaction with the human tutor who orients to the robot's conduct as appropriate and responsive. In contrast, when the robot did not engage in an appropriate responsive interaction, the tutor oriented more towards the object while gazing less at the robot.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128222440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modelling the face-to-face effect: Sensory population dynamics and active vision can contribute to perception of social context","authors":"N. Wilkinson, G. Metta, G. Gredebäck","doi":"10.1109/DEVLRN.2011.6037366","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037366","url":null,"abstract":"We introduce a novel dynamical model for visual attention based on stimulus induced population dynamics in an oscillatory medium, and apply this model to active perception of social content in still images. Making use of images from the newly emerging face-to-face paradigm in social developmental psychology, we show that this model generates patterns of eye movements that exhibit increased frequency of gaze shifts between actors in the social condition, as do infants at 16 months of age. The number of gaze shifts can inform useful levels of classification for the social content in the images, demonstrating a potential role for the dynamics of active perception in social cognition. This adaptive performance does not require any long term changes in structure or information storage. Our results further suggest a potential functional role in selective attention for the spiral wave activity recently observed in primary visual neo-cortex.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":" October","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120827539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Ciancio, L. Zollo, E. Guglielmelli, Daniele Caligiore, G. Baldassarre
{"title":"Hierarchical reinforcement learning and central pattern generators for modeling the development of rhythmic manipulation skills","authors":"A. Ciancio, L. Zollo, E. Guglielmelli, Daniele Caligiore, G. Baldassarre","doi":"10.1109/DEVLRN.2011.6037370","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037370","url":null,"abstract":"The development of manipulation skills is a fundamental process for young primates as it leads them to acquire the capacity to modify the world to their advantage. As other motor skills, manipulation develops on the basis of motor babbling processes which are initially heavily based on the production of rhythmic movements. We propose a computational bio-inspired model to investigate the development of functional rhythmic hand skills from initially unstructured movements. The model is based on a hierarchical reinforcement-learning actor-critic model that searches the parameters of a set of central pattern generators (CPGs) having different degrees of sophistication. The model is tested with a simulated robotic hand engaged in rotating bottle cap-like objects having different shape and size. The results show that the model is capable of developing skills based on different combinations of CPGs so as to suitably manipulate the different objects. Overall, the model shows to be a valuable tool for the study of the development of rhythmic manipulation skills in primates.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127882104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Measuring word learning performance in computational models and infants","authors":"C. Bergmann, L. Boves, L. T. Bosch","doi":"10.1109/DEVLRN.2011.6037354","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037354","url":null,"abstract":"In the present paper we investigate the effect of categorising raw behavioural data or computational model responses. In addition, the effect of averaging over stimuli from potentially different populations is assessed. To this end, we replicate studies on word learning and generalisation abilities using the ACORNS models. Our results show that discrete categories may obscure interesting phenomena in the continuous responses. For example, the finding that learning in the model saturates very early at a uniform high recognition accuracy only holds for categorical representations. Additionally, a large difference in the accuracy for individual words is obscured by averaging over all stimuli. Because different words behaved differently for different speakers, we could not identify a phonetic basis for the differences. Implications and new predictions for infant behaviour are discussed.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":"249 1-2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132350058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reward-driven learning of sensorimotor laws and visual features","authors":"Jens Kleesiek, A. Engel, C. Weber, S. Wermter","doi":"10.1109/DEVLRN.2011.6037358","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037358","url":null,"abstract":"A frequently reoccurring task of humanoid robots is the autonomous navigation towards a goal position. Here we present a simulation of a purely vision-based docking behavior in a 3-D physical world. The robot learns sensorimotor laws and visual features simultaneously and exploits both for navigation towards its virtual target region. The control laws are trained using a two-layer network consisting of a feature (sensory) layer that feeds into an action (Q-value) layer. A reinforcement feedback signal (delta) modulates not only the action but at the same time the feature weights. Under this influence, the network learns interpretable visual features and assigns goal-directed actions successfully. This is a step towards investigating how reinforcement learning can be linked to visual perception.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114781488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Joint development of disparity tuning and vergence control","authors":"Wanting Sun, Bertram E. Shi","doi":"10.1109/DEVLRN.2011.6037338","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037338","url":null,"abstract":"Behavior and sensory perception are mutually dependent. Sensory perception drives behavior, but behavior can also influence the development of sensory perception, by altering the statistics of the sensory input. Thus, there is a “chicken-and-egg” problem as to which arises first. We propose here a solution to this problem in the context of the neural processing of binocular disparity and the behavioral control of binocular vergence to maintain fixation. We show that it is possible for both the neural processing and the control policy to develop simultaneously. In particular, we assume that the neural processing develops through learning a sparse complex-cell representation of the input, and that the control policy simultaneously develops through reinforcement learning to maximize the activity in this complex cell representation. These processes are coupled. The control policy determines the statistics of the input, which determines the sparse coding that develops, which in turn determines the reward maximized by the control policy. Our experiments show that both disparity selective binocular receptive fields and a successful binocular fixation policy develop. Our results underline the importance of behavior, as we show that on the same input but in the absence of learned behavior, much fewer disparity selective binocular receptive fields develop.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115526984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bootstrapping intrinsically motivated learning with human demonstration","authors":"S. Nguyen, Adrien Baranes, Pierre-Yves Oudeyer","doi":"10.1109/DEVLRN.2011.6037329","DOIUrl":"https://doi.org/10.1109/DEVLRN.2011.6037329","url":null,"abstract":"This paper studies the coupling of internally guided learning and social interaction, and more specifically the improvement owing to demonstrations of the learning by intrinsic motivation. We present Socially Guided Intrinsic Motivation by Demonstration (SGIM-D), an algorithm for learning in continuous, unbounded and non-preset environments. After introducing social learning and intrinsic motivation, we describe the design of our algorithm, before showing through a fishing experiment that SGIM-D efficiently combines the advantages of social learning and intrinsic motivation to gain a wide repertoire while being specialised in specific subspaces.","PeriodicalId":256921,"journal":{"name":"2011 IEEE International Conference on Development and Learning (ICDL)","volume":"191 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122009646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}