{"title":"Vision intelligence-conditioned reinforcement learning for precision assembly","authors":"Sichao Liu, Lihui Wang (1)","doi":"10.1016/j.cirp.2025.04.016","DOIUrl":null,"url":null,"abstract":"<div><div>Robots that embrace human-level performance on precise, dexterous and dynamic assembly tasks can significantly enhance the efficiency in precision assembly but remain big challenges. This paper introduces a vision intelligence-conditioned method for precision assembly, enabled by human-in-the-loop reinforcement learning. Upon visual demonstrations collected and trained by a reward classifier, a data-efficient reinforcement learning algorithm trains and learns vision-based robotic manipulation policies under human-in-the-loop corrections. An impedance-based control strategy derived from policies and visual guidance achieves high-precision contact-rich assembly manipulations with near-perfect success rates (above 98%) and compliance behaviours. The effectiveness of the presented method is experimentally demonstrated with semiconductor assembly.</div></div>","PeriodicalId":55256,"journal":{"name":"Cirp Annals-Manufacturing Technology","volume":"74 1","pages":"Pages 13-17"},"PeriodicalIF":3.2000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cirp Annals-Manufacturing Technology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0007850625000642","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
Robots that embrace human-level performance on precise, dexterous and dynamic assembly tasks can significantly enhance the efficiency in precision assembly but remain big challenges. This paper introduces a vision intelligence-conditioned method for precision assembly, enabled by human-in-the-loop reinforcement learning. Upon visual demonstrations collected and trained by a reward classifier, a data-efficient reinforcement learning algorithm trains and learns vision-based robotic manipulation policies under human-in-the-loop corrections. An impedance-based control strategy derived from policies and visual guidance achieves high-precision contact-rich assembly manipulations with near-perfect success rates (above 98%) and compliance behaviours. The effectiveness of the presented method is experimentally demonstrated with semiconductor assembly.
期刊介绍:
CIRP, The International Academy for Production Engineering, was founded in 1951 to promote, by scientific research, the development of all aspects of manufacturing technology covering the optimization, control and management of processes, machines and systems.
This biannual ISI cited journal contains approximately 140 refereed technical and keynote papers. Subject areas covered include:
Assembly, Cutting, Design, Electro-Physical and Chemical Processes, Forming, Abrasive processes, Surfaces, Machines, Production Systems and Organizations, Precision Engineering and Metrology, Life-Cycle Engineering, Microsystems Technology (MST), Nanotechnology.