{"title":"Lessons from Industry for Science Cyberinfrastructure: Simplicity, Scale, and Sustainability via SaaS/PaaS","authors":"Ian T Foster","doi":"10.1145/2753524.2753525","DOIUrl":"https://doi.org/10.1145/2753524.2753525","url":null,"abstract":"Commercial information technology has changed dramatically over the past decade, with profound consequences for both software developers and software consumers. Software-as-a-service (SaaS) enables remote use of powerful capabilities, from accounting and payroll to weather alerts and transporation logistics, that used to require expensive in-house facilities and expertise. Platform-as-a-service (PaaS) offerings from cloud providers simplify the development and operation of SaaS software. These developments have slashed costs, reduced barriers to access and entry, and spurred innovation. Science cyberinfrastructure, in contrast, seems stuck in the 20th Century. I discuss lessons from industry that the scientific community might consider when designing cyberinfrastructure for the 21st Century.","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125003060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Achieving Formal Parallel Program Debugging by Incentivizing CS/HPC Collaborative Tool Development","authors":"G. Gopalakrishnan, G. Sawaya","doi":"10.1145/2753524.2753531","DOIUrl":"https://doi.org/10.1145/2753524.2753531","url":null,"abstract":"Many disruptive changes are happening in the arena of parallel computing, including the use of multiple compute element types (CPUs and GPUs), memory and interconnect types, as well as multiple concurrency models. In the face of these changes, cybersoftware development and debugging will become increasingly hard, requiring principled (formal) debugging approaches that are focused on anticipated classes of bugs. Academic research groups must play a central role in the creation of formal debugging solutions that are to become part of the nation's cyberinfrastructure. These solutions must be scalable and prove effective for debugging the parallel and distributed systems being planned for the coming decade. However, at present, there isn't a critical mass of academic groups engaged in designing such debugging solutions for HPC. In this paper, we summarize some of our own experiences in creating verification tools based on formal principles, pointing out the few instances in which we managed to impact practice. Our key observation is that our successes went hand-in-hand with collaborative tool development with practitioners. Seeking creative ways to facilitate such collaborations should therefore be high priority in our cyberinfrastructure roadmap.","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"108 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115069886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Session details: Session 1","authors":"Daniel S. Katz","doi":"10.1145/3261035","DOIUrl":"https://doi.org/10.1145/3261035","url":null,"abstract":"","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122990334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Heiland, S. Koranda, S. Marru, M. Pierce, Von Welch
{"title":"Authentication and Authorization Considerations for a Multi-tenant Service","authors":"R. Heiland, S. Koranda, S. Marru, M. Pierce, Von Welch","doi":"10.1145/2753524.2753534","DOIUrl":"https://doi.org/10.1145/2753524.2753534","url":null,"abstract":"Distributed cyberinfrastructure requires users (and machines) to perform some sort of authentication and authorization (together simply known as emph{auth}). In the early days of computing, authentication was performed with just a username and password combination, and this is still prevalent today. But during the past several years, we have seen an evolution of approaches and protocols for auth: Kerberos, SSH keys, X.509, OpenID, API keys, OAuth, and more. Not surprisingly, there are trade-offs, both technical and social, for each approach. The NSF Science Gateway communities have had to deal with a variety of auth issues. However, most of the early gateways were rather restrictive in their model of access and development. The practice of using community credentials (certificates), a well-intentioned idea to alleviate restrictive access, still posed a barrier to researchers and challenges for security and auditing. And while the web portal-based gateway clients offered users easy access from a browser, both the interface and the back-end functionality were constrained in the flexibility and extensibility they could provide. Designing a well-defined application programming interface (API) to fine-grained, generic gateway services (on secure, hosted cyberinfrastructure), together with an auth approach that has a lower barrier to entry, will hopefully present a more welcoming environment for both users and developers. This paper provides a review and some thoughts on these topics, with a focus on the role of auth between a Science Gateway and a service provider.","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123713508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jeremy Fischer, S. Tuecke, Ian T Foster, C. Stewart
{"title":"Jetstream: A Distributed Cloud Infrastructure for Underresourced higher education communities","authors":"Jeremy Fischer, S. Tuecke, Ian T Foster, C. Stewart","doi":"10.1145/2753524.2753530","DOIUrl":"https://doi.org/10.1145/2753524.2753530","url":null,"abstract":"The US National Science Foundation (NSF) in 2015 awarded funding for a first-of-a-kind distributed cyberinfrastructure (DCI) system called Jetstream. Jetstream will be the NSF's first production cloud for general-purpose science and engineering research and education. Jetstream, scheduled for production in January 2016, will be based on the OpenStack cloud environment software with a menu-driven interface to make it easy for users to select a pre-composed Virtual Machine (VM) to perform a particular discipline-specific analysis. Jetstream will use the Atmosphere user interface developed as part of iPlant, providing a low barrier to use by practicing scientists, engineers, educators, and students, and Globus services from the University of Chicago for seamless integration into the national cyberinfrastructure fabric. The team implementing Jetstream has as their primary mission extending the reach of the NSF's eXtreme Digital (XD) program to researchers, educators, and research students who have not previously used NSF XD program resources, including those in communities and at institutions that traditionally lack significant cyberinfrastructure resources. We will, for example, use virtual Linux Desktops to deliver DCI capabilities supporting research and research education at small colleges and universities, including Historically Black Colleges and Universities (HBCUs), Minority Serving Institutions (MSIs), Tribal colleges, and higher education institutions in states designated by the NSF as eligible for funding via the Experimental Program to Stimulate Competitive Research (EPSCoR). Jetstream will be a novel distributed cyberinfrastructure, with production components in Indiana and Texas. In particular, Jetstream will deliver virtual Linux desktops to tablet devices and PDAs with reasonable responsiveness running over cellular networks. This paper will discuss design and application plans for Jetstream as a novel Distributed CyberInfrastructure system for research education.","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122134189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Session details: Session 3","authors":"S. Jha","doi":"10.1145/3261037","DOIUrl":"https://doi.org/10.1145/3261037","url":null,"abstract":"","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"454 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116156960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Marru, M. Pierce, Sudhakar Pamidighantam, Chathuri Wimalasena
{"title":"Apache Airavata as a Laboratory: Architecture and Case Study for Component-Based Gateway Middleware","authors":"S. Marru, M. Pierce, Sudhakar Pamidighantam, Chathuri Wimalasena","doi":"10.1145/2753524.2753529","DOIUrl":"https://doi.org/10.1145/2753524.2753529","url":null,"abstract":"Science gateways are more than user interfaces to computational grids and clouds. Gateways are middleware in their own right, providing flexible, lightweight federations of heterogenous collections of computing resources (such as campus clusters, supercomputers, computational clouds), all of which remain challenges for many alternative middleware approaches. Gateways also are notable for providing science application-centric interfaces to computing resources rather than resource-centric views. An important challenge for science gateway research is to generalize specific science gateway strategies, defining a reference architecture that emcompasses major gateway capabilities while enabling implementation flexibility. Such a reference architecture should also enable \"platform as a service\" approaches that provide hosted versions of common gateway capabilities. In this paper, we summarize the Apache Airavata software system as a candidate reference architecture for science gateways. We propose the use of a component-based architecture to encompass major gateway capabilities (such as metadata management, meta-scheduling, execution management, and messaging). We examine the messaging system component in this abstract architecture in detail and describe its re-implementation and validation using third party messaging system software to replace a custom-built messaging system. Besides the operational validation of this specific component, we infer a preliminary validation of the overall architecture. The flexibility of component implementations within an overall architecture is essential as it allows gateway middleware to be the subject of distributed computing research for its own sake while also ensuring that we don't get locked into less than optimal implementations for gateway operations.","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125833727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data Centric Discovery with a Data-Oriented Architecture","authors":"R. Schuler, C. Kesselman, K. Czajkowski","doi":"10.1145/2753524.2753532","DOIUrl":"https://doi.org/10.1145/2753524.2753532","url":null,"abstract":"Increasingly, scientific discovery is driven by the analysis, manipulation, organization, annotation, sharing, and reuse of high-value scientific data. While great attention has been given to the specifics of analyzing and mining data, we find that there are almost no tools nor systematic infrastructure to facilitate the process of discovery from data. We argue that a more systematic perspective is required, and in particular, propose a data-centric approach in which discovery stands on a foundation of data and data collections, rather than on fleeting transformations and operations. To address the challenges of data-centric discovery, we introduce a Data-Oriented Architecture and contrast it with the prevalent Service-Oriented Architecture. We describe an instance of the Data-Oriented Architecture and describe how it has been used in a variety of use cases.","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128639270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Linh Ngo, Michael E. Payne, Flavio Villanustre, Richard Taylor, A. Apon
{"title":"Dynamic Provisioning of Data Intensive Computing Middleware Frameworks: A Case Study","authors":"Linh Ngo, Michael E. Payne, Flavio Villanustre, Richard Taylor, A. Apon","doi":"10.1145/2753524.2753528","DOIUrl":"https://doi.org/10.1145/2753524.2753528","url":null,"abstract":"Big data has become an important asset for industry, and academic disciplines now utilize large-scale data in their research. This fourth paradigm of scientific research has led to the inclusion of data management, processing, and analytic tools into the traditional high performance computing software libraries. This integration is facilitated through a collection of supporting software components that comprise a data intensive computing middleware framework. From a shared campus cyberinfrastructure perspective, this represents a new challenge to the system administrators in balancing between the traditional high performance computing software stacks and the new data-intensive middleware on the same physical computing resource. In turn, this limits researchers from having access to the new middleware tools while administrators determine how to overcome the challenge. In this paper, we present our experience in configuring dynamic provisioning of two different data-intensive middleware frameworks from a user perspective. We describe the configuration process from setting up dependencies to deploying the middleware, and how this experience can be applied by other researchers and administrators.","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124808761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Session details: Session 2","authors":"J. Weissman","doi":"10.1145/3261036","DOIUrl":"https://doi.org/10.1145/3261036","url":null,"abstract":"","PeriodicalId":321665,"journal":{"name":"Proceedings of the 1st Workshop on The Science of Cyberinfrastructure: Research, Experience, Applications and Models","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130705162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}