Machine learning enabled uncertainty set for data-driven robust optimization

IF 3.9 2区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

Journal of Process Control Pub Date : 2024-12-01 DOI:10.1016/j.jprocont.2024.103339

Yun Li , Neil Yorke-Smith , Tamas Keviczky

{"title":"Machine learning enabled uncertainty set for data-driven robust optimization","authors":"Yun Li , Neil Yorke-Smith , Tamas Keviczky","doi":"10.1016/j.jprocont.2024.103339","DOIUrl":null,"url":null,"abstract":"<div><div>The way how the uncertainties are represented by sets plays a vital role in the performance of robust optimization (RO). This paper presents a novel approach leveraging machine learning (ML) techniques to construct data-driven uncertainty sets from historical uncertainty data for RO problems. The proposed method integrates Density-Based Spatial Clustering of Applications with Noise (DBSCAN), Gaussian Mixture Model (GMM), and Principle Component Analysis (PCA) systematically to eliminate the influence of uncertainty scenarios with low occurrence probability and generate a nonconvex uncertainty set that is a union of multiple basic subsets (box or ellipsoid) without sacrificing its computational tractability. In addition to presenting a comprehensive algorithm for uncertainty set development, this paper offers detailed guidelines for parameter tuning and performance analysis. By harnessing the well-established ML packages <span>scikit-learn</span>, a Python-based toolkit for implementing the proposed approach is also provided. Furthermore, a computationally efficient solution for a two-stage linear RO problem with the proposed data-driven uncertainty set is derived, alongside establishing a probabilistic guarantee of constraint satisfaction for out-of-sample uncertainties. Extensive numerical experiments, conducted on both synthetic and real-world datasets as well as an optimization-based control problem, are performed to demonstrate the efficacy of the proposed methodology.</div></div>","PeriodicalId":50079,"journal":{"name":"Journal of Process Control","volume":"144 ","pages":"Article 103339"},"PeriodicalIF":3.9000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Process Control","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0959152424001793","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

The way how the uncertainties are represented by sets plays a vital role in the performance of robust optimization (RO). This paper presents a novel approach leveraging machine learning (ML) techniques to construct data-driven uncertainty sets from historical uncertainty data for RO problems. The proposed method integrates Density-Based Spatial Clustering of Applications with Noise (DBSCAN), Gaussian Mixture Model (GMM), and Principle Component Analysis (PCA) systematically to eliminate the influence of uncertainty scenarios with low occurrence probability and generate a nonconvex uncertainty set that is a union of multiple basic subsets (box or ellipsoid) without sacrificing its computational tractability. In addition to presenting a comprehensive algorithm for uncertainty set development, this paper offers detailed guidelines for parameter tuning and performance analysis. By harnessing the well-established ML packages scikit-learn, a Python-based toolkit for implementing the proposed approach is also provided. Furthermore, a computationally efficient solution for a two-stage linear RO problem with the proposed data-driven uncertainty set is derived, alongside establishing a probabilistic guarantee of constraint satisfaction for out-of-sample uncertainties. Extensive numerical experiments, conducted on both synthetic and real-world datasets as well as an optimization-based control problem, are performed to demonstrate the efficacy of the proposed methodology.

查看原文本刊更多论文

机器学习实现了数据驱动鲁棒优化的不确定性集

如何用集合表示不确定性对鲁棒优化（RO）的性能起着至关重要的作用。本文提出了一种利用机器学习（ML）技术从RO问题的历史不确定性数据构建数据驱动的不确定性集的新方法。该方法将基于密度的带噪声应用空间聚类（DBSCAN）、高斯混合模型（GMM）和主成分分析（PCA）相结合，在不牺牲计算可跟踪性的前提下，消除低概率不确定性场景的影响，生成由多个基本子集（盒或椭球）组成的非凸不确定性集。除了提出一种综合的不确定性集开发算法外，本文还提供了参数调优和性能分析的详细指南。通过利用完善的ML包scikit-learn，还提供了用于实现所建议方法的基于python的工具包。此外，推导了具有数据驱动不确定性集的两阶段线性RO问题的计算效率解，并建立了样本外不确定性约束满足的概率保证。在合成和真实数据集以及基于优化的控制问题上进行了广泛的数值实验，以证明所提出方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Process Control 工程技术-工程：化工

CiteScore

7.00

自引率

11.90%

发文量

159

审稿时长

74 days

期刊介绍： This international journal covers the application of control theory, operations research, computer science and engineering principles to the solution of process control problems. In addition to the traditional chemical processing and manufacturing applications, the scope of process control problems involves a wide range of applications that includes energy processes, nano-technology, systems biology, bio-medical engineering, pharmaceutical processing technology, energy storage and conversion, smart grid, and data analytics among others. Papers on the theory in these areas will also be accepted provided the theoretical contribution is aimed at the application and the development of process control techniques. Topics covered include: • Control applications• Process monitoring• Plant-wide control• Process control systems• Control techniques and algorithms• Process modelling and simulation• Design methods Advanced design methods exclude well established and widely studied traditional design techniques such as PID tuning and its many variants. Applications in fields such as control of automotive engines, machinery and robotics are not deemed suitable unless a clear motivation for the relevance to process control is provided.