Errikos Streviniotis , Nikos Giatrakos , Yannis Kotidis , Thaleia Ntiniakou , Miguel Ponce de Leon
{"title":"RATS:一个资源分配器,用于在高性能计算基础设施上优化肿瘤模拟的执行","authors":"Errikos Streviniotis , Nikos Giatrakos , Yannis Kotidis , Thaleia Ntiniakou , Miguel Ponce de Leon","doi":"10.1016/j.is.2025.102538","DOIUrl":null,"url":null,"abstract":"<div><div>In this work, we introduce RATS (<u>R</u>esource <u>A</u>llocator for <u>T</u>umor <u>S</u>imulations), the first optimizer for the execution of tumor simulations over HPC infrastructures. Given a set of drug therapies under in-silico study, the optimization framework of RATS can: <em>(i)</em> devise the optimal number of cores and prescribe the required number of core hours; and <em>(ii)</em> under core capacity constraints, RATS schedules the execution of simulations minimizing the overall number of core hours, simultaneously prioritizing the execution of expectedly promising in-silico trials higher compared to unpromising ones. RATS is deployed by life scientists at the Barcelona Supercomputing Center to remove the burden of blindly guessing the core hours needing to be reserved from HPC admins to study various tumor treatment methodologies, as well as to rapidly distinguish effective drug combinations, thus, potentially cutting time to market for new cancer therapies. The latter is further elevated by the RATS+ extension we plug into the initial framework. RATS+ employs a Transfer Learning approach to leverage optimization models and decisions from prior in-silico studies, thereby reducing the optimization effort required for new studies in this domain.</div><div>Our experimental evaluation, on real-world data derived from the execution of more than 2500 tumor simulations on the MareNostrum4 supercomputer, confirms the effectiveness of both RATS and RATS+ across the aforementioned performance dimensions.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"132 ","pages":"Article 102538"},"PeriodicalIF":3.0000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RATS: A resource allocator for optimizing the execution of tumor simulations over HPC infrastructures\",\"authors\":\"Errikos Streviniotis , Nikos Giatrakos , Yannis Kotidis , Thaleia Ntiniakou , Miguel Ponce de Leon\",\"doi\":\"10.1016/j.is.2025.102538\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this work, we introduce RATS (<u>R</u>esource <u>A</u>llocator for <u>T</u>umor <u>S</u>imulations), the first optimizer for the execution of tumor simulations over HPC infrastructures. Given a set of drug therapies under in-silico study, the optimization framework of RATS can: <em>(i)</em> devise the optimal number of cores and prescribe the required number of core hours; and <em>(ii)</em> under core capacity constraints, RATS schedules the execution of simulations minimizing the overall number of core hours, simultaneously prioritizing the execution of expectedly promising in-silico trials higher compared to unpromising ones. RATS is deployed by life scientists at the Barcelona Supercomputing Center to remove the burden of blindly guessing the core hours needing to be reserved from HPC admins to study various tumor treatment methodologies, as well as to rapidly distinguish effective drug combinations, thus, potentially cutting time to market for new cancer therapies. The latter is further elevated by the RATS+ extension we plug into the initial framework. RATS+ employs a Transfer Learning approach to leverage optimization models and decisions from prior in-silico studies, thereby reducing the optimization effort required for new studies in this domain.</div><div>Our experimental evaluation, on real-world data derived from the execution of more than 2500 tumor simulations on the MareNostrum4 supercomputer, confirms the effectiveness of both RATS and RATS+ across the aforementioned performance dimensions.</div></div>\",\"PeriodicalId\":50363,\"journal\":{\"name\":\"Information Systems\",\"volume\":\"132 \",\"pages\":\"Article 102538\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306437925000237\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306437925000237","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
在这项工作中,我们介绍了RATS (Resource Allocator for Tumor Simulations),这是在高性能计算基础设施上执行肿瘤模拟的第一个优化器。给定一组正在进行计算机化研究的药物疗法,rat优化框架可以:(i)设计出最优的核心数,并规定所需的核心小时数;(ii)在核心容量限制下,RATS调度模拟的执行,使核心小时总数最小化,同时优先执行预期有希望的计算机试验,而不是没有希望的试验。巴塞罗那超级计算中心的生命科学家部署了RATS,以消除盲目猜测HPC管理员需要保留多少核心时间来研究各种肿瘤治疗方法的负担,并快速区分有效的药物组合,从而可能缩短新癌症疗法的上市时间。我们插入到初始框架中的RATS+扩展进一步提升了后者。RATS+采用迁移学习方法来利用先前计算机研究中的优化模型和决策,从而减少了该领域新研究所需的优化工作。我们的实验评估来自MareNostrum4超级计算机上执行的2500多个肿瘤模拟的真实数据,证实了RATS和RATS+在上述性能维度上的有效性。
RATS: A resource allocator for optimizing the execution of tumor simulations over HPC infrastructures
In this work, we introduce RATS (Resource Allocator for Tumor Simulations), the first optimizer for the execution of tumor simulations over HPC infrastructures. Given a set of drug therapies under in-silico study, the optimization framework of RATS can: (i) devise the optimal number of cores and prescribe the required number of core hours; and (ii) under core capacity constraints, RATS schedules the execution of simulations minimizing the overall number of core hours, simultaneously prioritizing the execution of expectedly promising in-silico trials higher compared to unpromising ones. RATS is deployed by life scientists at the Barcelona Supercomputing Center to remove the burden of blindly guessing the core hours needing to be reserved from HPC admins to study various tumor treatment methodologies, as well as to rapidly distinguish effective drug combinations, thus, potentially cutting time to market for new cancer therapies. The latter is further elevated by the RATS+ extension we plug into the initial framework. RATS+ employs a Transfer Learning approach to leverage optimization models and decisions from prior in-silico studies, thereby reducing the optimization effort required for new studies in this domain.
Our experimental evaluation, on real-world data derived from the execution of more than 2500 tumor simulations on the MareNostrum4 supercomputer, confirms the effectiveness of both RATS and RATS+ across the aforementioned performance dimensions.
期刊介绍:
Information systems are the software and hardware systems that support data-intensive applications. The journal Information Systems publishes articles concerning the design and implementation of languages, data models, process models, algorithms, software and hardware for information systems.
Subject areas include data management issues as presented in the principal international database conferences (e.g., ACM SIGMOD/PODS, VLDB, ICDE and ICDT/EDBT) as well as data-related issues from the fields of data mining/machine learning, information retrieval coordinated with structured data, internet and cloud data management, business process management, web semantics, visual and audio information systems, scientific computing, and data science. Implementation papers having to do with massively parallel data management, fault tolerance in practice, and special purpose hardware for data-intensive systems are also welcome. Manuscripts from application domains, such as urban informatics, social and natural science, and Internet of Things, are also welcome. All papers should highlight innovative solutions to data management problems such as new data models, performance enhancements, and show how those innovations contribute to the goals of the application.