{"title":"iREbikeLANCE:基于历史使用数据的优化共享单车分配的强化学习代理","authors":"Igor Betkier","doi":"10.1016/j.softx.2025.102387","DOIUrl":null,"url":null,"abstract":"<div><div>This paper introduces iREbikeLANCE, a comprehensive reinforcement learning platform consisting of 2,326 lines of original Python backend code and an interactive web interface. The platform features a custom bike-sharing simulation environment, asynchronous training pipeline, and novel reward engineering interface. iREbikeLANCE integrates real-world BSS data from Warsaw's Veturilo system with a Proximal Policy Optimization (PPO) agent from the Stable Baselines3 library. Its core novelty lies in providing users with an intuitive interface to interactively define and tune multi-component reward function weights for the RL agent. The platform allows users to configure simulation parameters, initiate training, monitor progress via real-time logs and metrics, visualize station states on a map, compare agent performance against baselines, and download resulting models. By abstracting away underlying coding complexity, iREbikeLANCE empowers researchers, students, and practitioners to explore different rebalancing heuristics, understand the impact of reward engineering on agent behavior, and accelerate the development of more efficient and adaptive BSS operations.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"32 ","pages":"Article 102387"},"PeriodicalIF":2.4000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"iREbikeLANCE: Reinforcement learning agent for optimal bike-sharing distribution powered by historical usage data\",\"authors\":\"Igor Betkier\",\"doi\":\"10.1016/j.softx.2025.102387\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper introduces iREbikeLANCE, a comprehensive reinforcement learning platform consisting of 2,326 lines of original Python backend code and an interactive web interface. The platform features a custom bike-sharing simulation environment, asynchronous training pipeline, and novel reward engineering interface. iREbikeLANCE integrates real-world BSS data from Warsaw's Veturilo system with a Proximal Policy Optimization (PPO) agent from the Stable Baselines3 library. Its core novelty lies in providing users with an intuitive interface to interactively define and tune multi-component reward function weights for the RL agent. The platform allows users to configure simulation parameters, initiate training, monitor progress via real-time logs and metrics, visualize station states on a map, compare agent performance against baselines, and download resulting models. By abstracting away underlying coding complexity, iREbikeLANCE empowers researchers, students, and practitioners to explore different rebalancing heuristics, understand the impact of reward engineering on agent behavior, and accelerate the development of more efficient and adaptive BSS operations.</div></div>\",\"PeriodicalId\":21905,\"journal\":{\"name\":\"SoftwareX\",\"volume\":\"32 \",\"pages\":\"Article 102387\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SoftwareX\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S235271102500353X\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SoftwareX","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235271102500353X","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
iREbikeLANCE: Reinforcement learning agent for optimal bike-sharing distribution powered by historical usage data
This paper introduces iREbikeLANCE, a comprehensive reinforcement learning platform consisting of 2,326 lines of original Python backend code and an interactive web interface. The platform features a custom bike-sharing simulation environment, asynchronous training pipeline, and novel reward engineering interface. iREbikeLANCE integrates real-world BSS data from Warsaw's Veturilo system with a Proximal Policy Optimization (PPO) agent from the Stable Baselines3 library. Its core novelty lies in providing users with an intuitive interface to interactively define and tune multi-component reward function weights for the RL agent. The platform allows users to configure simulation parameters, initiate training, monitor progress via real-time logs and metrics, visualize station states on a map, compare agent performance against baselines, and download resulting models. By abstracting away underlying coding complexity, iREbikeLANCE empowers researchers, students, and practitioners to explore different rebalancing heuristics, understand the impact of reward engineering on agent behavior, and accelerate the development of more efficient and adaptive BSS operations.
期刊介绍:
SoftwareX aims to acknowledge the impact of software on today''s research practice, and on new scientific discoveries in almost all research domains. SoftwareX also aims to stress the importance of the software developers who are, in part, responsible for this impact. To this end, SoftwareX aims to support publication of research software in such a way that: The software is given a stamp of scientific relevance, and provided with a peer-reviewed recognition of scientific impact; The software developers are given the credits they deserve; The software is citable, allowing traditional metrics of scientific excellence to apply; The academic career paths of software developers are supported rather than hindered; The software is publicly available for inspection, validation, and re-use. Above all, SoftwareX aims to inform researchers about software applications, tools and libraries with a (proven) potential to impact the process of scientific discovery in various domains. The journal is multidisciplinary and accepts submissions from within and across subject domains such as those represented within the broad thematic areas below: Mathematical and Physical Sciences; Environmental Sciences; Medical and Biological Sciences; Humanities, Arts and Social Sciences. Originating from these broad thematic areas, the journal also welcomes submissions of software that works in cross cutting thematic areas, such as citizen science, cybersecurity, digital economy, energy, global resource stewardship, health and wellbeing, etcetera. SoftwareX specifically aims to accept submissions representing domain-independent software that may impact more than one research domain.