{"title":"Coordination of data movement with computation scheduling on a cluster","authors":"John Bent, D. Rotem, A. Romosan, A. Shoshani","doi":"10.1109/CLADE.2005.1520896","DOIUrl":"https://doi.org/10.1109/CLADE.2005.1520896","url":null,"abstract":"We are looking at the problem of scheduling compute tasks on a cluster of servers. These tasks require files that reside on a remote archive, and may also be cached on some subset of the servers. A task can only be run on a server that has the files it requires. This introduces the problem of scheduling data movement in coordination with the scheduling of computation. Our goal is to maximize throughput while minimizing data movement. FIFO scheduling is not efficient in this situation due to its lack of awareness of the data movement required. We looked at two other strategies, called shortest job first and linear programming based optimization, and compared them under various configurations.","PeriodicalId":330715,"journal":{"name":"CLADE 2005. Proceedings Challenges of Large Applications in Distributed Environments, 2005.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134280589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"FSML: Fusion Simulation Markup Language for interoperability of data and analysis tools","authors":"S. Shasharina, Chuang Li","doi":"10.1109/CLADE.2005.1520913","DOIUrl":"https://doi.org/10.1109/CLADE.2005.1520913","url":null,"abstract":"As the fusion community becomes more interconnected and problems become more complex, very close collaborative efforts are expected. This requires internetworking various codes, comparing solutions from multiple solvers, and sharing of data and data analysis tools. However, the data formats and data analysis tools used in fusion and plasma simulations are highly heterogeneous. Imposing one standard data format and one type of tools is unrealistic due to historical and practical reasons. In this paper, we propose to create the Fusion Simulation Markup Language or FSML - an XML based system for describing and accessing fusion and plasma physics simulation data of various formats used in the community. The system consists of syntactic and semantic metadata organized in specialized XML schemas and APIs written for accessing data from major data analysis and visualization tools. We present the preliminary results in formulation the FSML schema and APIs in AVS/Express modules, and demonstrate their application for two large three-dimension fusion simulation codes M3D and NIMROD. The results show that FSML schema and the set of tools developed provide a strong initial momentum and technology for the community effort to enhance data exchange and interoperability of analysis tools.","PeriodicalId":330715,"journal":{"name":"CLADE 2005. Proceedings Challenges of Large Applications in Distributed Environments, 2005.","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127145868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Thomas W. Jackson, M. Jessop, A. Pasley, J. Austin
{"title":"Searching against distributed data using a Web service architecture","authors":"Thomas W. Jackson, M. Jessop, A. Pasley, J. Austin","doi":"10.1109/CLADE.2005.1520914","DOIUrl":"https://doi.org/10.1109/CLADE.2005.1520914","url":null,"abstract":"Many condition health-monitoring applications require access to distributed data assets. The DAME project has investigated one such example based upon condition monitoring of civil aero-engine sensor data. A service-based solution is introduced that has been implemented within the Globus grid framework. It provides a general architecture for distributed search and identifies the generic functionality that is required.","PeriodicalId":330715,"journal":{"name":"CLADE 2005. Proceedings Challenges of Large Applications in Distributed Environments, 2005.","volume":"128 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121338904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}