G. Pierantoni, D. Frost, K. Cassidy, S. Kenny, J. O'neill, P. Tiernan, E. Kilfeather
{"title":"The Digital Repository of Ireland","authors":"G. Pierantoni, D. Frost, K. Cassidy, S. Kenny, J. O'neill, P. Tiernan, E. Kilfeather","doi":"10.1109/IWSG.2015.17","DOIUrl":"https://doi.org/10.1109/IWSG.2015.17","url":null,"abstract":"The Digital Repository of Ireland is a trusted digital repository that contributes to a national digital infrastructure for the humanities and social sciences in Ireland. Although trusted digital repositories are not mainstream application in the domain of science gateways and the social sciences are not the usual domain for a science gateway we argue in this paper that DRI has many of the principal characteristics of a science gateway and that the domain of social science will benefit in the future from the same advantages that other more numerical-based disciplines reap from science gateways. In addition to offering tools to ingest vast amounts of heterogeneous data sets, a trusted digital repository must offer specific functionalities regarding persistence and trustworthiness of the stored data sets. Trusted digital repositories must maintain the information for a long period and this entails precise and at times difficult choices in the architecture and implementation details. Another facet in which DRI (and most TDRs) differ from mainstream science gateways is the relationship between data and metadata, in many science gateways, metadata is created from the automated analysis of data sets, while in DRI much of the metadata is manually defined by specialised users. Furthermore, as the data sets are heterogeneous in nature, the metadata used for their description is of crucial importance. In this paper we argue that TDRs such as DRI, will play an increasingly important role in the future of the social science community and that they constitute the modern equivalent of the great libraries of the past, places in which knowledge was kept and preserved to be available to researchers of the present and the future. Furthermore, although DRI is far from being the only Trusted Digital Repository available, we argue that certaing design decision such as the support of multiple metadata standards and its modular nature, make it an interesting example of a flexible TDR.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115347735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Gesing, Richard Grunzke, Jens Krüger, S. Herres‐Pawlis, A. Hoffmann
{"title":"Challenges and Modifications for Creating a MoSGrid Science Gateway for US and European Infrastructures","authors":"S. Gesing, Richard Grunzke, Jens Krüger, S. Herres‐Pawlis, A. Hoffmann","doi":"10.1109/IWSG.2015.10","DOIUrl":"https://doi.org/10.1109/IWSG.2015.10","url":null,"abstract":"The established German MoSGrid science gateway serves the computational chemistry community in the areas of quantum chemistry, molecular dynamics and docking with intuitive user interfaces for jobs, workflows and data as well as metadata management. It is designed independent of the underlying infrastructure and developed on top of gUSE/WS-PGRADE, which is capable of connecting to diverse grid, cloud and batch infrastructures. However, quite a few of its user-friendly features and pre-configured workflows are dependent on the service-oriented grid middleware UNICORE and on installed computational chemistry tools on the target resources. Furthermore, the security mechanisms rely on SAML assertions to provide single sign-on to the different available resources. This paper gives an overview on the challenges and modifications, which have been necessary to provide an international MoSGrid science gateway exploiting XSEDE and PRACE infrastructures. This work was part of the XSEDE-PRACE interoperability project, which selected three use cases out of US- and European-wide applications.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126838370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Alvaro Aguilera, Richard Grunzke, Ulf Markwardt, Dirk Schollbach, Dirk Habich, J. Garcke
{"title":"Towards an Industry Data Gateway: An Integrated Platform for the Analysis of Wind Turbine Data","authors":"Alvaro Aguilera, Richard Grunzke, Ulf Markwardt, Dirk Schollbach, Dirk Habich, J. Garcke","doi":"10.1109/IWSG.2015.8","DOIUrl":"https://doi.org/10.1109/IWSG.2015.8","url":null,"abstract":"The increasing amount of data produced in many scientific and engineering domains creates as many new challenges for an efficient data analysis, as possibilities for its application. In this paper, we present one of the use-cases of the project VAVID, namely the condition monitoring of sensor information from wind turbines, and how a data gateway can help to increase the usability and security of the proposed system. Starting by briefly introducing the project, the paper presents the problem of handling and processing large amount of sensor data using existing tools in the context of wind turbines. It goes on to describe the innovative approach used in VAVID to meet this challenge, covering the main goals, numerical methods used for analysis, the storage concept, and architectural design. It concludes by offering a rational for the use of a data gateway as the main entry point to the system and how this is being implemented in VAVID.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128079336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Extending Scientific Workflow Systems to Support MapReduce Based Applications in the Cloud","authors":"Shashank Gugnani, T. Kiss","doi":"10.1109/IWSG.2015.15","DOIUrl":"https://doi.org/10.1109/IWSG.2015.15","url":null,"abstract":"Cloud Computing has gained a lot of popularity in recent years because of the flexibility that it offers. In addition, there seems to be a rising interest in combining Parallel Computing, Cloud Computing and Big Data to create large scale scientific applications. WS-PGRADE is a gateway framework that allows users to create such applications by defining them as scientific workflows. This paper investigates how workflow systems and science gateways, such as WS-PGRADE, can be extended with data processing capabilities of Hadoop based on the MapReduce paradigm in the cloud. Analysis shows the methods described to integrate Hadoop with workflows and science gateways work well in different scenarios and can be used to create massively parallel applications for scientific analysis of Big Data.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"21 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132270752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Connecting Workflow-Oriented Science Gateways to Multi-cloud Systems","authors":"Z. Farkas, P. Kacsuk, Á. Hajnal","doi":"10.1109/IWSG.2015.20","DOIUrl":"https://doi.org/10.1109/IWSG.2015.20","url":null,"abstract":"In this paper we investigate solutions to cloud-enable workflow-oriented science gateways. The integration mechanism described in the paper is a generic method that can be followed by other gateway developers. The paper describes the principles and the concrete ways how to integrate science gateways with multi-cloud systems. The concrete example to demonstrate the integration principles builds on the integration of WS-PGRADE/gUSE and the Cloud Broker Platform (CBP). The integration of WS-PGRADE/gUSE and Cloud Broker offers a complete cloud-enabled science gateway platform for a diverse set of use-cases and user communities, with the availability to use mainstream cloud middleware types and services (Amazon EC2/S3, IBM, Open Stack, Open Nebula, Rados S3). The advantage of the integrated WS-PGRADE/gUSE/CBP system is that if a domain-specific science gateway is customized from WS-PGRADE/gUSE gateway framework it immediately inherits this cloud access flexibility, i.e. The user community of that gateway can access all the cloud types enabled by the integrated WS-PGRADE/gUSE/CBP system.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114518904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Arshad, G. Terstyánszky, T. Kiss, N. Weingarten
{"title":"A Definition and Analysis of the Role of Meta-workflows in Workflow Interoperability","authors":"J. Arshad, G. Terstyánszky, T. Kiss, N. Weingarten","doi":"10.1109/IWSG.2015.18","DOIUrl":"https://doi.org/10.1109/IWSG.2015.18","url":null,"abstract":"Scientific workflows orchestrate the execution of complex experiments on high performance computing platforms. Meta-workflows represent an emerging type of such workflows which aim to integrate multiple embedded workflows from potentially different workflow systems to achieve complex experimentation. Workflow interoperability plays a profound role in achieving this objective. This paper is focused at formalizing definitions of different types of workflows and meta-workflows to facilitate improved understanding and interoperability. It also includes thorough formalization of the coarse grained workflow interoperability approach highlighting the role of workflow systems. The paper presents a case study from Heliophysics which successfully demonstrates the use of technologies developed to realize the concepts of meta-workflows and workflow interoperability within a science gateway environment.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123070915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Brokering Solution for Science Gateways Using Multiple Distributed Computing Infrastructures","authors":"K. Karóczkai, A. Kertész, P. Kacsuk","doi":"10.1109/IWSG.2015.12","DOIUrl":"https://doi.org/10.1109/IWSG.2015.12","url":null,"abstract":"The ever growing number of computation-intensive applications calls for the interoperation of distributed infrastructures such as Clouds, Grids and private clusters. The European SHIWA and ER-flow projects have been initiated to enable the combination of heterogeneous scientific workflows, and to execute them in a large-scale system consisting of multiple Distributed Computing Infrastructures including Grids and Clouds. One of the resource management challenges of these projects called parameter study job scheduling. A parameter study job of a workflow has a large number of input files to be consumed by independent job instances. In this paper we propose a meta-brokering solution for these workflows to be applied in science gateways. To cope with the high uncertainty and unpredictable load of the utilized infrastructures, we rely on resource priority services. These tools are capable of determining and dynamically updating priorities of the available infrastructures to be selected for job instances.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"79 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131189554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Herres‐Pawlis, A. Hoffmann, T. Rosener, Richard Grunzke, Jens Krüger, S. Gesing
{"title":"Multi-layer Meta-metaworkflows for the Evaluation of Solvent and Dispersion Effects in Transition Metal Systems Using the MoSGrid Science Gateways","authors":"S. Herres‐Pawlis, A. Hoffmann, T. Rosener, Richard Grunzke, Jens Krüger, S. Gesing","doi":"10.1109/IWSG.2015.13","DOIUrl":"https://doi.org/10.1109/IWSG.2015.13","url":null,"abstract":"Through the MoSGrid (Molecular Simulation Grid) Science Gateways, users have easy and highly efficient access to computational resources. Within the quantum chemical domain, we have developed meta-meta workflows for quantum chemical applications which contain re-usable meta workflows. Here, we present a new use case which demonstrates that the multi-layer idea can enhance the concept of meta workflows. Workflows are implemented by using the WS-PGRADE technology. This approach allows researchers to generate their jobs more easily. Moreover, every abstraction layer adds a factor of 2-3 in efficiency as a lot of manual job definition time can be saved.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127449727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"TraceRep: Gateway for Sharing and Collecting Traces in HPC Systems","authors":"Iván Pérez, E. Vallejo, J. L. Bosque","doi":"10.1109/IWSG.2015.11","DOIUrl":"https://doi.org/10.1109/IWSG.2015.11","url":null,"abstract":"Traces of parallel programs are a valuable resource for users of HPC systems because they provide insight about the efficiency of the execution of their applications, allowing to improve the code. Additionally, for computer scientists they are useful as an input to simulation tools to guide the development of high performance computing (HPC) systems. Unfortunately, the limited access to these systems, the knowledge required to use the trace extraction tools, and the huge volume that traces can reach, makes the collection and sharing of traces hard. Trace Rep is a computer science gateway designed in the University of Cantabria designed to solve this problems. Its web interface and automated cluster access hides the complexity of collecting traces, and its public repository section simplifies sharing traces, including mechanisms to licence the data and guarantee authorship. TraceRep is available at: http://tracerep.unican.es.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133532868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Processing Manager for Science Gateways","authors":"M. Jaghoori, Shayan Shahand, S. Olabarriaga","doi":"10.1109/IWSG.2015.9","DOIUrl":"https://doi.org/10.1109/IWSG.2015.9","url":null,"abstract":"In this paper, we define the concept of Processing in the context of science gateways (SGs), and the necessary management layer that is needed to handle it. A Processing captures the execution/run of a scientific Application, the Data consumed and produced during that run, and the User that performs it. The properties that define Application, Data and User are different across various layers, from the scientific domain to the infrastructure. We describe a software layer, the Processing Manager (PM), that manages the translation of a Processing from the domain level to the infrastructure level and back. The PM offers a homogeneous abstraction layer over heterogeneous execution platforms and data services for various infrastructures, thus simplifying the development, maintenance and operation of SGs. Finally, the PM enables external high-level control and monitoring of the execution process and naturally captures the provenance of each processing.","PeriodicalId":341012,"journal":{"name":"2015 7th International Workshop on Science Gateways","volume":"133 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122129909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}