Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning最新文献_第2页

Continuation Path Learning for Homotopy Optimization 同伦优化的延拓路径学习

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-24 DOI: 10.48550/arXiv.2307.12551

Xi Lin, Zhiyuan Yang, Xiao-Yan Zhang, Qingfu Zhang

{"title":"Continuation Path Learning for Homotopy Optimization","authors":"Xi Lin, Zhiyuan Yang, Xiao-Yan Zhang, Qingfu Zhang","doi":"10.48550/arXiv.2307.12551","DOIUrl":"https://doi.org/10.48550/arXiv.2307.12551","url":null,"abstract":"Homotopy optimization is a traditional method to deal with a complicated optimization problem by solving a sequence of easy-to-hard surrogate subproblems. However, this method can be very sensitive to the continuation schedule design and might lead to a suboptimal solution to the original problem. In addition, the intermediate solutions, often ignored by classic homotopy optimization, could be useful for many real-world applications. In this work, we propose a novel model-based approach to learn the whole continuation path for homotopy optimization, which contains infinite intermediate solutions for any surrogate subproblems. Rather than the classic unidirectional easy-to-hard optimization, our method can simultaneously optimize the original problem and all surrogate subproblems in a collaborative manner. The proposed model also supports real-time generation of any intermediate solution, which could be desirable for many applications. Experimental studies on different problems show that our proposed method can significantly improve the performance of homotopy optimization and provide extra helpful information to support better decision-making.","PeriodicalId":74529,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning","volume":"65 1","pages":"21288-21311"},"PeriodicalIF":0.0,"publicationDate":"2023-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82599543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On the Effectiveness of Offline RL for Dialogue Response Generation 离线强化学习在对话响应生成中的有效性研究

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-23 DOI: 10.48550/arXiv.2307.12425

Paloma Sodhi, Felix Wu, Ethan R. Elenberg, Kilian Q. Weinberger, Ryan T. McDonald

引用次数: 1

Model-based Offline Reinforcement Learning with Count-based Conservatism 基于模型的基于计数保守的离线强化学习

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-21 DOI: 10.48550/arXiv.2307.11352

Byeongchang Kim, Min-hwan Oh

引用次数: 1

Reparameterized Policy Learning for Multimodal Trajectory Optimization 多模态轨迹优化的再参数化策略学习

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-20 DOI: 10.48550/arXiv.2307.10710

Zhiao Huang, Litian Liang, Z. Ling, Xuanlin Li, Chuang Gan, H. Su

{"title":"Reparameterized Policy Learning for Multimodal Trajectory Optimization","authors":"Zhiao Huang, Litian Liang, Z. Ling, Xuanlin Li, Chuang Gan, H. Su","doi":"10.48550/arXiv.2307.10710","DOIUrl":"https://doi.org/10.48550/arXiv.2307.10710","url":null,"abstract":"We investigate the challenge of parametrizing policies for reinforcement learning (RL) in high-dimensional continuous action spaces. Our objective is to develop a multimodal policy that overcomes limitations inherent in the commonly-used Gaussian parameterization. To achieve this, we propose a principled framework that models the continuous RL policy as a generative model of optimal trajectories. By conditioning the policy on a latent variable, we derive a novel variational bound as the optimization objective, which promotes exploration of the environment. We then present a practical model-based RL method, called Reparameterized Policy Gradient (RPG), which leverages the multimodal policy parameterization and learned world model to achieve strong exploration capabilities and high data efficiency. Empirical results demonstrate that our method can help agents evade local optima in tasks with dense rewards and solve challenging sparse-reward environments by incorporating an object-centric intrinsic reward. Our method consistently outperforms previous approaches across a range of tasks. Code and supplementary materials are available on the project page https://haosulab.github.io/RPG/","PeriodicalId":74529,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning","volume":"23 1","pages":"13957-13975"},"PeriodicalIF":0.0,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87664226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Private Federated Learning with Autotuned Compression 具有自调优压缩的私有联邦学习

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-20 DOI: 10.48550/arXiv.2307.10999

Enayat Ullah, Christopher A. Choquette-Choo, P. Kairouz, Sewoong Oh

引用次数: 1

Fractional Denoising for 3D Molecular Pre-training 三维分子预训练的分数去噪

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-20 DOI: 10.48550/arXiv.2307.10683

Shi Feng, Yuyan Ni, Yanyan Lan, Zhiming Ma, Wei-Ying Ma

{"title":"Fractional Denoising for 3D Molecular Pre-training","authors":"Shi Feng, Yuyan Ni, Yanyan Lan, Zhiming Ma, Wei-Ying Ma","doi":"10.48550/arXiv.2307.10683","DOIUrl":"https://doi.org/10.48550/arXiv.2307.10683","url":null,"abstract":"Coordinate denoising is a promising 3D molecular pre-training method, which has achieved remarkable performance in various downstream drug discovery tasks. Theoretically, the objective is equivalent to learning the force field, which is revealed helpful for downstream tasks. Nevertheless, there are two challenges for coordinate denoising to learn an effective force field, i.e. low coverage samples and isotropic force field. The underlying reason is that molecular distributions assumed by existing denoising methods fail to capture the anisotropic characteristic of molecules. To tackle these challenges, we propose a novel hybrid noise strategy, including noises on both dihedral angel and coordinate. However, denoising such hybrid noise in a traditional way is no more equivalent to learning the force field. Through theoretical deductions, we find that the problem is caused by the dependency of the input conformation for covariance. To this end, we propose to decouple the two types of noise and design a novel fractional denoising method (Frad), which only denoises the latter coordinate part. In this way, Frad enjoys both the merits of sampling more low-energy structures and the force field equivalence. Extensive experiments show the effectiveness of Frad in molecular representation, with a new state-of-the-art on 9 out of 12 tasks of QM9 and on 7 out of 8 targets of MD17.","PeriodicalId":74529,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning","volume":"14 1","pages":"9938-9961"},"PeriodicalIF":0.0,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89062630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

From Adaptive Query Release to Machine Unlearning 从自适应查询释放到机器学习

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-20 DOI: 10.48550/arXiv.2307.11228

Enayat Ullah, R. Arora

{"title":"From Adaptive Query Release to Machine Unlearning","authors":"Enayat Ullah, R. Arora","doi":"10.48550/arXiv.2307.11228","DOIUrl":"https://doi.org/10.48550/arXiv.2307.11228","url":null,"abstract":"We formalize the problem of machine unlearning as design of efficient unlearning algorithms corresponding to learning algorithms which perform a selection of adaptive queries from structured query classes. We give efficient unlearning algorithms for linear and prefix-sum query classes. As applications, we show that unlearning in many problems, in particular, stochastic convex optimization (SCO), can be reduced to the above, yielding improved guarantees for the problem. In particular, for smooth Lipschitz losses and any $rho>0$, our results yield an unlearning algorithm with excess population risk of $tilde Obig(frac{1}{sqrt{n}}+frac{sqrt{d}}{nrho}big)$ with unlearning query (gradient) complexity $tilde O(rho cdot text{Retraining Complexity})$, where $d$ is the model dimensionality and $n$ is the initial number of samples. For non-smooth Lipschitz losses, we give an unlearning algorithm with excess population risk $tilde Obig(frac{1}{sqrt{n}}+big(frac{sqrt{d}}{nrho}big)^{1/2}big)$ with the same unlearning query (gradient) complexity. Furthermore, in the special case of Generalized Linear Models (GLMs), such as those in linear and logistic regression, we get dimension-independent rates of $tilde Obig(frac{1}{sqrt{n}} +frac{1}{(nrho)^{2/3}}big)$ and $tilde Obig(frac{1}{sqrt{n}} +frac{1}{(nrho)^{1/3}}big)$ for smooth Lipschitz and non-smooth Lipschitz losses respectively. Finally, we give generalizations of the above from one unlearning request to textit{dynamic} streams consisting of insertions and deletions.","PeriodicalId":74529,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning","volume":"11 1","pages":"34642-34667"},"PeriodicalIF":0.0,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82605525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sequential Multi-Dimensional Self-Supervised Learning for Clinical Time Series 临床时间序列的顺序多维自监督学习

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-20 DOI: 10.48550/arXiv.2307.10923

Aniruddh Raghu, P. Chandak, Ridwan Alam, John Guttag, Collin M. Stultz

{"title":"Sequential Multi-Dimensional Self-Supervised Learning for Clinical Time Series","authors":"Aniruddh Raghu, P. Chandak, Ridwan Alam, John Guttag, Collin M. Stultz","doi":"10.48550/arXiv.2307.10923","DOIUrl":"https://doi.org/10.48550/arXiv.2307.10923","url":null,"abstract":"Self-supervised learning (SSL) for clinical time series data has received significant attention in recent literature, since these data are highly rich and provide important information about a patient's physiological state. However, most existing SSL methods for clinical time series are limited in that they are designed for unimodal time series, such as a sequence of structured features (e.g., lab values and vitals signs) or an individual high-dimensional physiological signal (e.g., an electrocardiogram). These existing methods cannot be readily extended to model time series that exhibit multimodality, with structured features and high-dimensional data being recorded at each timestep in the sequence. In this work, we address this gap and propose a new SSL method -- Sequential Multi-Dimensional SSL -- where a SSL loss is applied both at the level of the entire sequence and at the level of the individual high-dimensional data points in the sequence in order to better capture information at both scales. Our strategy is agnostic to the specific form of loss function used at each level -- it can be contrastive, as in SimCLR, or non-contrastive, as in VICReg. We evaluate our method on two real-world clinical datasets, where the time series contains sequences of (1) high-frequency electrocardiograms and (2) structured data from lab values and vitals signs. Our experimental results indicate that pre-training with our method and then fine-tuning on downstream tasks improves performance over baselines on both datasets, and in several settings, can lead to improvements across different self-supervised loss functions.","PeriodicalId":74529,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning","volume":"68 1","pages":"28531-28548"},"PeriodicalIF":0.0,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89500786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Contextual Reliability: When Different Features Matter in Different Contexts 上下文可靠性:当不同的特征在不同的上下文中起作用时

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-19 DOI: 10.48550/arXiv.2307.10026

Gaurav R. Ghosal, Amrith Rajagopal Setlur, Daniel S. Brown, A. Dragan, Aditi Raghunathan

{"title":"Contextual Reliability: When Different Features Matter in Different Contexts","authors":"Gaurav R. Ghosal, Amrith Rajagopal Setlur, Daniel S. Brown, A. Dragan, Aditi Raghunathan","doi":"10.48550/arXiv.2307.10026","DOIUrl":"https://doi.org/10.48550/arXiv.2307.10026","url":null,"abstract":"Deep neural networks often fail catastrophically by relying on spurious correlations. Most prior work assumes a clear dichotomy into spurious and reliable features; however, this is often unrealistic. For example, most of the time we do not want an autonomous car to simply copy the speed of surrounding cars -- we don't want our car to run a red light if a neighboring car does so. However, we cannot simply enforce invariance to next-lane speed, since it could provide valuable information about an unobservable pedestrian at a crosswalk. Thus, universally ignoring features that are sometimes (but not always) reliable can lead to non-robust performance. We formalize a new setting called contextual reliability which accounts for the fact that the\"right\"features to use may vary depending on the context. We propose and analyze a two-stage framework called Explicit Non-spurious feature Prediction (ENP) which first identifies the relevant features to use for a given context, then trains a model to rely exclusively on these features. Our work theoretically and empirically demonstrates the advantages of ENP over existing methods and provides new benchmarks for contextual reliability.","PeriodicalId":74529,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning","volume":"173 1","pages":"11300-11320"},"PeriodicalIF":0.0,"publicationDate":"2023-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74155802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Convex Geometry of ReLU-layers, Injectivity on the Ball and Local Reconstruction relu层的凸几何、球上的注入性与局部重建

Proceedings of the ... International Conference on Machine Learning. International Conference on Machine Learning Pub Date : 2023-07-18 DOI: 10.48550/arXiv.2307.09672

Daniel Haider, M. Ehler, P. Balázs

引用次数: 0