Easing the Reuse of ML Solutions by Interactive Clustering-based Autotuning in Scientific Applications

2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA) Pub Date : 2022-08-01 DOI:10.1109/SEAA56994.2022.00011

H. Hajiabadi, Lennart Hilbert, A. Koziolek

{"title":"Easing the Reuse of ML Solutions by Interactive Clustering-based Autotuning in Scientific Applications","authors":"H. Hajiabadi, Lennart Hilbert, A. Koziolek","doi":"10.1109/SEAA56994.2022.00011","DOIUrl":null,"url":null,"abstract":"Machine learning techniques have revolutionised scientific software projects. Scientists are continuously looking for novel approaches to production-quality reuse of machine learning solutions and to make them available to other components of the project with satisfactory quality and low costs. However, scientists often have limited knowledge about how to effectively reuse and adjust machine learning solutions in their particular scientific project. One challenge is that many machine learning solutions require parameter tuning based on the input data to achieve satisfactory results, which is difficult and cumbersome for users not familiar with machine learning. Autotuning is the common technique for potentially adjusting the parameters based on the data, but it requires a well-defined objective function to optimize for. Such an objective function is commonly unknown in exploratory scientific research such as biological image segmentation tasks. In this paper, we propose a framework based on the novel combination of autotuning and active learning to ease and partially automate the reuse effort of machine learning solutions for scientists in biological image segmentation cases. Underlying this combination is a mapping between an object type and specific parameters applied during the segmentation process. This mapping is iteratively adjusted by asking users for visual feedback. We then through a biological case study demonstrate that our method enables tuning of the segmentation specifically to object types, while the selective requests of user input reduce the number of user interactions required for this task.","PeriodicalId":269970,"journal":{"name":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","volume":"515 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SEAA56994.2022.00011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Machine learning techniques have revolutionised scientific software projects. Scientists are continuously looking for novel approaches to production-quality reuse of machine learning solutions and to make them available to other components of the project with satisfactory quality and low costs. However, scientists often have limited knowledge about how to effectively reuse and adjust machine learning solutions in their particular scientific project. One challenge is that many machine learning solutions require parameter tuning based on the input data to achieve satisfactory results, which is difficult and cumbersome for users not familiar with machine learning. Autotuning is the common technique for potentially adjusting the parameters based on the data, but it requires a well-defined objective function to optimize for. Such an objective function is commonly unknown in exploratory scientific research such as biological image segmentation tasks. In this paper, we propose a framework based on the novel combination of autotuning and active learning to ease and partially automate the reuse effort of machine learning solutions for scientists in biological image segmentation cases. Underlying this combination is a mapping between an object type and specific parameters applied during the segmentation process. This mapping is iteratively adjusted by asking users for visual feedback. We then through a biological case study demonstrate that our method enables tuning of the segmentation specifically to object types, while the selective requests of user input reduce the number of user interactions required for this task.

查看原文本刊更多论文

在科学应用中通过基于交互式聚类的自动调优简化机器学习解决方案的重用

机器学习技术已经彻底改变了科学软件项目。科学家们一直在寻找新的方法来实现机器学习解决方案的生产质量重用，并使它们能够以令人满意的质量和低成本提供给项目的其他组件。然而，科学家们通常对如何在特定的科学项目中有效地重用和调整机器学习解决方案知之甚少。一个挑战是，许多机器学习解决方案需要根据输入数据进行参数调优才能达到满意的结果，这对于不熟悉机器学习的用户来说是困难和繁琐的。自动调优是基于数据调整参数的常用技术，但它需要一个定义良好的目标函数来进行优化。这样的目标函数在探索性科学研究如生物图像分割任务中通常是未知的。在本文中，我们提出了一个基于自动调整和主动学习的新组合的框架，以减轻和部分自动化科学家在生物图像分割案例中的机器学习解决方案的重用工作。这种组合的基础是对象类型和在分割过程中应用的特定参数之间的映射。这种映射是通过要求用户提供视觉反馈来迭代调整的。然后，我们通过一个生物学案例研究证明，我们的方法可以根据对象类型调整分割，而用户输入的选择性请求减少了该任务所需的用户交互次数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)

自引率

0.00%

发文量