C. Yeh, Fengqiang Li, G. Pastorelli, M. Walton, A. Katsaggelos, O. Cossairt
{"title":"Shape-from-Shifting: Uncalibrated Photometric Stereo with a Mobile Device","authors":"C. Yeh, Fengqiang Li, G. Pastorelli, M. Walton, A. Katsaggelos, O. Cossairt","doi":"10.1109/eScience.2017.89","DOIUrl":"https://doi.org/10.1109/eScience.2017.89","url":null,"abstract":"Surface shape scanning techniques, such as laser scanning and photometric stereo, are widespread analytical tools used in the field of cultural heritage. Compared to regular 2D RGB photos, 3D surface scans provide higher fidelity of an object's surface shape which assist conservators, art historians, and archaeologists in understanding how these artworks and artifacts are made and to digitally document them for purposes of conservation. However, current state-of-the-art 3D surface scanning tools used in art conservation are often expensive and bulky-such as light dome structures that are often over 1 m in diameter. In this paper, we introduce mobile shape-from-shifting (SfS): a simple, low-cost and streamlined photometric stereo framework for scanning planar surfaces with a consumer mobile device coupled to a low-cost add-on component. Our free-form mobile SfS framework relaxes the rigorous hardware and other complex requirements inherent to conventional 3D scanning tools. This is achieved by taking a sequence of photos with the on-board camera and flash of a mobile device. The sequence of captures are used to reconstruct high quality normal maps using nearlight photometric stereo algorithms, which are of comparable quality to conventional photometric stereo. We demonstrate 3D surface reconstructions with SfS on different materials and scales. Moreover, the mobile SfS technique can be used \"in the wild\" so that 3D scans may be performed in their natural environment, eliminating the need for transport to a laboratory setting. With the elegant design and low cost, we believe our Mobile SfS can greatly benefit the conservation community by providing a userfriendly and cost-effective solution for 3D surface scanning.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"279 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121037130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Raúl Palma, M. Krystek, José Manuél Gómez-Pérez, Andres Garcia-Silva, Sergio Ferraresi, S. Mantovani, S. Avolio, Sergio de Gioia
{"title":"Supporting Research and Operational Earth Science Portals through ROHub","authors":"Raúl Palma, M. Krystek, José Manuél Gómez-Pérez, Andres Garcia-Silva, Sergio Ferraresi, S. Mantovani, S. Avolio, Sergio de Gioia","doi":"10.1109/eScience.2017.71","DOIUrl":"https://doi.org/10.1109/eScience.2017.71","url":null,"abstract":"Earth science disciplines are increasingly producing large amounts of heterogeneous data that needs to be accessed, integrated and processed during the course of a research or operational process. In this setting, there is a growing need from earth scientists to collect and manage effectively these resources, including the data used, the methods applied, and the results produced. This is even more challenging if we consider i) the dynamic and collaborative nature of the earth science processes; ii) the interfaces typically used by scientists where they should be able to accomplish core management tasks with a minimal overhead. In this poster, we present our approach to address these challenges, which is based on research objects and on the associated technological support provided by ROHub platform. We show how ROHub is used as the underlying technology for supporting earth scientists work through different interfaces tailored to their communities needs and usage scenario.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124862099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Roselyne B. Tchoua, K. Chard, Debra J. Audus, Logan T. Ward, Joshua Lequieu, Juan J. de Pablo, Ian T Foster
{"title":"Towards a Hybrid Human-Computer Scientific Information Extraction Pipeline","authors":"Roselyne B. Tchoua, K. Chard, Debra J. Audus, Logan T. Ward, Joshua Lequieu, Juan J. de Pablo, Ian T Foster","doi":"10.1109/eScience.2017.23","DOIUrl":"https://doi.org/10.1109/eScience.2017.23","url":null,"abstract":"The emerging field of materials informatics has the potential to greatly reduce time-to-market and development costs for new materials. The success of such efforts hinges on access to large, high-quality databases of material properties. However, many such data are only to be found encoded in text within esoteric scientific articles, a situation that makes automated extraction difficult and manual extraction time-consuming and error-prone. To address this challenge, we present a hybrid Information Extraction (IE) pipeline to improve the machine-human partnership with respect to extraction quality and person-hours, through a combination of rule-based, machine learning, and crowdsourcing approaches. Our goal is to leverage computer and human strengths to alleviate the burden on human curators by automating initial extraction tasks before prioritizing and assigning specialized curation tasks to humans with different levels of training: using non-experts for straightforward tasks such as validation of higher accuracy results (e.g., completing partial facts) and domain experts for low-certainty results (e.g., reviewing specialized compound labels). To validate our approaches, we focus on the task of extracting the glass transition temperature of polymers from published articles. Applying our approaches to 6 090 articles, we have so far extracted 259 refined data values. We project that this number will grow considerably as we tune our methods and process more articles, to exceed that found in standard, expert-curated polymer data handbooks while also being easier to keep up-to-date. The freely available data can be found on our Polymer Properties Predictor and Database website at http://pppdb.uchicago.edu.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"341 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123314139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
P. Gagnier, H. Maschner, Aureliane Gailliegue, Loic Norgeot, C. Dapogny, L. Revéret, A. Abourachid
{"title":"Automatic Reconstruction of Polygon Triangulation for Mounted Skeleton Point Cloud","authors":"P. Gagnier, H. Maschner, Aureliane Gailliegue, Loic Norgeot, C. Dapogny, L. Revéret, A. Abourachid","doi":"10.1109/eScience.2017.86","DOIUrl":"https://doi.org/10.1109/eScience.2017.86","url":null,"abstract":"In the collections of natural history, mounted skeletons are among the most complex objects. They are composed of hundreds of different bones, tedious to digitize accurately in 3D because many surfaces remain hidden to the scanning device. A group of researchers from Pierre et Marie Curie (Paris 6) and Grenoble Universities teamed up with researchers from the National Museum of Natural History in Paris in order to design and implement a mathematical model of the bone surface deformation through optimization. The goal is to produce a surface triangulation adapted to the underlying surface intrinsic geometric properties from the sole datum of point clouds of skeleton bones. Outlier points will be removed from the data and the remaining inlier points will be labeled according to their membership to a specific bone structure. The results will be validated from the anatomical point of view and will be used to conduct functional morphology analysis. In our approach, each bone reconstruction of a skeleton will be obtained by morphing a generic representative surface of the same equivalence class using a mathematical derivative-based model. The resolution of this problem will lead to a definition of a closed orientable surface and will allow to account for the conservation of specifically labeled components.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121987749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Sen, N. Rao, Qiang Liu, N. Imam, R. Kettimuthu, Ian T Foster
{"title":"On Analytics of File Transfer Rates over Dedicated Wide-Area Connections","authors":"S. Sen, N. Rao, Qiang Liu, N. Imam, R. Kettimuthu, Ian T Foster","doi":"10.1109/eScience.2017.93","DOIUrl":"https://doi.org/10.1109/eScience.2017.93","url":null,"abstract":"File transfers between the decentralized storage sites over dedicated wide-area connections are becoming increasingly important in high-performance computing and big data scenarios. Designing such scientific workflows for large file transfers is extremely challenging as they depend on the file, I/O, host, and local- and wide-area network subsystems, and their interactions. To gain insights into file-transfer rate profiles, we develop polynomial, bagging, and boosting regression models for Lustre and XFS file transfer measurements, which are collected using XDD over a suite of 10 Gbps connections with 0-366 ms round trip times (RTTs). In addition to overall trends and analytics, these regressions also provide file-transfer rate estimates for RTTs and number of parallel flows at which measurements might not have been collected. They show that bagging and boosting techniques provide closer data fits than the polynomial regression. We develop probabilistic bounds on the generalization error of these methods, which combined with the cross-validation error establish that former two are more accurate estimators than the polynomial regression. In addition, we present a method to efficiently determine the number of parallel flows to achieve a peak file-transfer rate using fewer than full sweep measurements; in our measurements, the peak is achieved in 96% of cases with 15-25% of measurements of a full sweep.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128507612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Learning on Operational Facility Data Related to Large-Scale Distributed Area Scientific Workflows","authors":"Alok Singh, E. Stephan, M. Schram, I. Altintas","doi":"10.1109/eScience.2017.94","DOIUrl":"https://doi.org/10.1109/eScience.2017.94","url":null,"abstract":"Distributed computing platforms provide a robust mechanism to perform large-scale computations by splitting the task and data among multiple locations, possibly located thousands of miles apart geographically. Although such distribution of resources can lead to benefits, it also comes with its associated problems such as rampant duplication of file transfers increasing congestion, long job completion times, unexpected site crashing, suboptimal data transfer rates, unpredictable reliability in a time range, and suboptimal usage of storage elements. In addition, each sub-system becomes a potential failure node that can trigger system wide disruptions. In this vision paper, we outline our approach to leveraging Deep Learning algorithms to discover solutions to unique problems that arise in a system with computational infrastructure that is spread over a wide area. The presented vision, motivated by a real scientific use case from Belle II experiments, is to develop multilayer neural networks to tackle forecasting, anomaly detection and optimization challenges in a complex and distributed data movement environment. Through this vision based on Deep Learning principles, we aim to achieve reduced congestion events, faster file transfer rates, and enhanced site reliability.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123483799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"3D Scientific Visualisation of 19th Century Glass Replicas of Invertebrates","authors":"D. Abate, S. Hermon, S. Lotti, G. Innocenti","doi":"10.1109/eScience.2017.87","DOIUrl":"https://doi.org/10.1109/eScience.2017.87","url":null,"abstract":"The Leopold and Rudolf Blaschka replicas of ma- rine invertebrates are among the more famous objects produced in the XIXth century for pedagogical purposes. Their high level of detail, still unknown manufacturing technique and their con- ceived artistic beauty, make these glass replicas precious museum exhibits currently. The fragility of these items produced by the Blaschka family (father and son) poses challenges for their preservation, study and presentation to the public. Moreover, their complex reflectivity and materiality (painted glass, coloured glass, glass and metal, etc.) presents a particularly interesting 3D digitisation challenge. The paper reports first results of such doumentation, performed on selected artefacts from the collections of the Fondazione Scienza e Tecnica in Florence, Italy and the zoological Section \"La Specola\" of the Natural History Muse- um of the University of Florence, both inaccessible to the public nowadays. The method applied consists of portable photogram- metry, using polarised light sources. The obtained 3D models are accurate digital surrogates of the originals, returning geometrical features and precise color information, and can be used for scientific purposes (measurements and visual observations), pedagogical purposes or presentation to the wide public in virtual exhibitions.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114712866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Large Scholarly Corpus: A Bird's-Eye View","authors":"Y. Najaflou, K. Bubendorfer","doi":"10.1109/eScience.2017.75","DOIUrl":"https://doi.org/10.1109/eScience.2017.75","url":null,"abstract":"In this paper we present a new, very large, rich, Comprehensive Scholarly Corpus (CompScholarCorp) as a platform and data source for future research. Our corpus contains records of 1,044,454 papers, 472,365 unique authors, and substantial publication meta-data for each record. We have integrated the data we collected from 276 publishers using a uniform and consistent XML data format within the corpus. The corpus is designed to be compatible with DBLP enabling existing research to utilise our new corpus directly. As an initial analysis of the corpus, we present a number of visualisations of the corpus to better understand the data, provide some analytics of the data, and present a rule-of-thumb we have observed for citations.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127922301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tekin Bicer, D. Gürsoy, R. Kettimuthu, Ian T Foster, Bin Ren, V. Andrade, F. Carlo
{"title":"Real-Time Data Analysis and Autonomous Steering of Synchrotron Light Source Experiments","authors":"Tekin Bicer, D. Gürsoy, R. Kettimuthu, Ian T Foster, Bin Ren, V. Andrade, F. Carlo","doi":"10.1109/eScience.2017.53","DOIUrl":"https://doi.org/10.1109/eScience.2017.53","url":null,"abstract":"Modern scientific instruments, such as detectors at synchrotron light sources, can generate data at 10s of GB/sec. Current experimental protocols typically process and validate data only after an experiment has completed, which can lead to undetected errors and prevents online steering. Real-time data analysis can enable both detection of, and recovery from, errors, and optimization of data acquisition. We thus propose an autonomous stream processing system that allows data streamed from beamline computers to be processed in real time on a remote supercomputer, with a control feed-back loop used to make decisions during experimentation. We evaluate our system using two iterative tomographic reconstruction algorithms and varying data generation rates. These experiments are performed in a real-world environment in which data are streamed from a light source to a cluster for analysis and experimental control. We demonstrate that our system can sustain analysis rates of hundreds of projections per second by using up to 1,200 cores, while meeting stringent data quality constraints.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"17 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115058478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
José Manuél Gómez-Pérez, C. Meertens, F. Boler, H. Loescher, C. Laney, Daniel Crawl, I. Altintas
{"title":"Research Objects for Interworkability among Global Environmental and Geophysical Data","authors":"José Manuél Gómez-Pérez, C. Meertens, F. Boler, H. Loescher, C. Laney, Daniel Crawl, I. Altintas","doi":"10.1109/eScience.2017.67","DOIUrl":"https://doi.org/10.1109/eScience.2017.67","url":null,"abstract":"The provisioning and exploitation at a global scale of environmental and geophysical data requires advanced automation and governance mechanisms that enable (meta)data interoperability but also the exchange of formalized scientific concepts and methods. In this paper we introduce recent efforts in such direction, based on scientific workflows and research objects as enablers of such vision. The former enable the integration of different web services in a higher-level data processing artifact while the latter enhances governance around data product validation and consistency, result reproducibility and credit to the principal investigators and data providers. This paper provides a concise overview of our project, current status and next steps.","PeriodicalId":137652,"journal":{"name":"2017 IEEE 13th International Conference on e-Science (e-Science)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123446725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}