Navigating the Maze: The Impact of Configurability in Bioinformatics Software

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE) Pub Date : 2018-09-01 DOI:10.1145/3238147.3240466

Mikaela Cashman, Myra B. Cohen, P. Ranjan, R. Cottingham

{"title":"Navigating the Maze: The Impact of Configurability in Bioinformatics Software","authors":"Mikaela Cashman, Myra B. Cohen, P. Ranjan, R. Cottingham","doi":"10.1145/3238147.3240466","DOIUrl":null,"url":null,"abstract":"The bioinformatics software domain contains thousands of applications for automating tasks such as the pairwise alignment of DNA sequences, building and reasoning about metabolic models or simulating growth of an organism. Its end users range from sophisticated developers to those with little computational experience. In response to their needs, developers provide many options to customize the way their algorithms are tuned. Yet there is little or no automated help for the user in determining the consequences or impact of the options they choose. In this paper we describe our experience working with configurable bioinformatics tools. We find limited documentation and help for combining and selecting options along with variation in both functionality and performance. We also find previously undetected faults. We summarize our findings with a set of lessons learned, and present a roadmap for creating automated techniques to interact with bioinformatics software. We believe these will generalize to other types of scientific software.","PeriodicalId":6622,"journal":{"name":"2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)","volume":"32 1","pages":"757-767"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3238147.3240466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 23

Abstract

The bioinformatics software domain contains thousands of applications for automating tasks such as the pairwise alignment of DNA sequences, building and reasoning about metabolic models or simulating growth of an organism. Its end users range from sophisticated developers to those with little computational experience. In response to their needs, developers provide many options to customize the way their algorithms are tuned. Yet there is little or no automated help for the user in determining the consequences or impact of the options they choose. In this paper we describe our experience working with configurable bioinformatics tools. We find limited documentation and help for combining and selecting options along with variation in both functionality and performance. We also find previously undetected faults. We summarize our findings with a set of lessons learned, and present a roadmap for creating automated techniques to interact with bioinformatics software. We believe these will generalize to other types of scientific software.

查看原文本刊更多论文

导航迷宫:可配置性在生物信息学软件中的影响

生物信息学软件领域包含数千种自动化任务的应用程序，例如DNA序列的成对比对、代谢模型的构建和推理或模拟生物体的生长。它的最终用户范围从老练的开发人员到缺乏计算经验的人。为了满足他们的需求，开发人员提供了许多选项来定制他们的算法调优方式。然而，在确定用户选择的选项的后果或影响方面，几乎没有或根本没有自动帮助。在本文中，我们描述了我们使用可配置生物信息学工具的经验。对于组合和选择选项以及功能和性能的变化，我们找到了有限的文档和帮助。我们还发现了以前未检测到的故障。我们总结了我们的发现和一组经验教训，并提出了创建与生物信息学软件交互的自动化技术的路线图。我们相信这些将推广到其他类型的科学软件。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE)

自引率

0.00%

发文量