Proceedings of the Python in Science Conference最新文献_第5页

It's Time for the Atmospheric Science Community to ACT Together 现在是大气科学界共同行动的时候了

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/majora-1b6fd038-020

A. Theisen

引用次数: 0

The Pandata Scalable Open-Source Analysis Stack Pandata可扩展的开源分析堆栈

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/gerudo-f2bc6f59-00b

James Bednar, Martin Durant

引用次数: 0

Multi-dimensional linked-data exploration with glue 多维关联数据探索与胶水

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/majora-a6455521-002

T. Robitaille

引用次数: 0

Turning HPC Systems into Interactive Data Analysis Platforms using Jupyter and Dask 利用Jupyter和Dask将HPC系统转变为交互式数据分析平台

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/MAJORA-7DDC1DD1-01E

Anderson Banihirwe, M. Rocklin, J. Hamman, Julia Kent, Kevin Paul

引用次数: 0

pyhf: a pure Python statistical fitting library for High Energy Physics with tensors and autograd 一个纯Python统计拟合库，用于高能物理与张量和自grad

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/MAJORA-7DDC1DD1-019

M. Feickert, L. Heinrich, G. Stark, K. Cranmer

引用次数: 0

Automated Annotation of Animal Vocalizations 动物发声的自动注释

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/MAJORA-7DDC1DD1-024

D. Nicholson

引用次数: 0

vak: a neural network framework for researchers studying animal acoustic communication Vak:一个用于研究动物声音交流的神经网络框架

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/gerudo-f2bc6f59-008

D. Nicholson, Y. Cohen

{"title":"vak: a neural network framework for researchers studying animal acoustic communication","authors":"D. Nicholson, Y. Cohen","doi":"10.25080/gerudo-f2bc6f59-008","DOIUrl":"https://doi.org/10.25080/gerudo-f2bc6f59-008","url":null,"abstract":"—How is speech like birdsong? What do we mean when we say an animal learns their vocalizations? Questions like these are answered by studying how animals communicate with sound. As in many other fields, the study of acoustic communication is being revolutionized by deep neural network models. These models enable answering questions that were previously impossible to address, in part because the models automate analysis of very large datasets. Acoustic communication researchers have developed multiple models for similar tasks, often implemented as research code with one of several libraries, such as Keras and Pytorch. This situation has created a real need for a framework that allows researchers to easily benchmark multiple models, and test new models, with their own data. To address this need, we developed vak (https://github.com/vocalpy/vak), a neural network framework designed for acoustic communication researchers. (\"vak\" is pronounced like \"talk\" or \"squawk\" and was chosen for its similarity to the Latin root voc , as in \"vocal\".) Here we describe the design of the vak, and explain how the framework makes it easy for researchers to apply neural network models to their own data. We highlight enhancements made in version 1.0 that significantly improve user experience with the library. To provide researchers without expertise in deep learning access to these models, vak can be run via a command-line interface that uses configuration files. Vak can also be used directly in scripts by scientist-coders. To achieve this, vak adapts design patterns and an API from other domain-specific PyTorch libraries such as torchvision, with modules representing neural network operations, models, datasets, and transformations for pre-and post-processing. vak also leverages the Lightning library as a backend, so that vak developers and users can focus on the domain. We provide proof-of-concept results showing how vak can be used to test new models and compare existing models from multiple model families. In closing we discuss our roadmap for development and vision for the community","PeriodicalId":364654,"journal":{"name":"Proceedings of the Python in Science Conference","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130362314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

libyt: a Tool for Parallel In Situ Analysis with yt libt:一种与yt并行的原位分析工具

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/gerudo-f2bc6f59-011

Shin-Rong Tsai, Hsi-Yu Schive, Matthew Turk

引用次数: 0

Using Blosc2 NDim As A Fast Explorer Of The Milky Way (Or Any Other NDim Dataset) 使用Blosc2 NDim作为银河系(或任何其他NDim数据集)的快速探索者

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/gerudo-f2bc6f59-000

Project Blosc, Francesc Alted, Marta Iborra, Oscar Guiñón, David Ibáñez, S. Barrachina

{"title":"Using Blosc2 NDim As A Fast Explorer Of The Milky Way (Or Any Other NDim Dataset)","authors":"Project Blosc, Francesc Alted, Marta Iborra, Oscar Guiñón, David Ibáñez, S. Barrachina","doi":"10.25080/gerudo-f2bc6f59-000","DOIUrl":"https://doi.org/10.25080/gerudo-f2bc6f59-000","url":null,"abstract":"—Large multidimensional datasets are widely used in various engineering and scientific applications. Prompt access to the subsets of these datasets is crucial for an efficient exploration experience. To facilitate this, we have added support for large dimensional datasets to Blosc2, a compression and format library. The extension enables effective support for large multidimensional datasets, with a special encoding of zeros that allows for efficient handling of sparse datasets. Additionally, the new two-level data partition used in Blosc2 reduces the need for decompressing unnecessary data, further accelerating slicing speed. The Blosc2 NDim layer enables the creation and reading of n-dimensional datasets in an extremely efficient manner. This is due to a completely general n-dim 2-level partitioning, which allows for slicing and dicing of arbitrary large (and compressed) data in a more fine-grained way. Having a second partition provides a better flexibility to fit the different partitions at the different CPU cache levels, making compression even more efficient. Additionally, Blosc2 can make use of Btune, a library that automatically finds the optimal combination of compression parameters to suit user needs. Btune employs various techniques, such as a genetic algorithm and a neural network model, to discover the best parameters for a given dataset much more quickly. This approach is a significant improvement over the traditional trial-and-error method, which can take hours or even days to find the best parameters. As an example, we will demonstrate how Blosc2 NDim enables fast exploration of the Milky Way using the Gaia DR3 dataset.","PeriodicalId":364654,"journal":{"name":"Proceedings of the Python in Science Conference","volume":"3 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120982968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

aPhyloGeo-Covid: A Web Interface for Reproducible Phylogeographic Analysis of SARS-CoV-2 Variation using Neo4j and Snakemake aPhyloGeo-Covid:使用Neo4j和Snakemake对SARS-CoV-2变异进行可重复系统地理分析的Web界面

Proceedings of the Python in Science Conference Pub Date : 1900-01-01 DOI: 10.25080/gerudo-f2bc6f59-00f

Wanlin Li, Nadia Tahiri

{"title":"aPhyloGeo-Covid: A Web Interface for Reproducible Phylogeographic Analysis of SARS-CoV-2 Variation using Neo4j and Snakemake","authors":"Wanlin Li, Nadia Tahiri","doi":"10.25080/gerudo-f2bc6f59-00f","DOIUrl":"https://doi.org/10.25080/gerudo-f2bc6f59-00f","url":null,"abstract":"—The gene sequencing data, along with the associated lineage tracing and research data generated throughout the Coronavirus disease 2019 (COVID-19) pandemic, constitute invaluable resources that profoundly empower phylogeography research. To optimize the utilization of these resources, we have developed an interactive analysis platform called aPhyloGeo-Covid, leveraging the capabilities of Neo4j, Snakemake, and Python. This platform enables researchers to explore and visualize diverse data sources speciﬁcally relevant to SARS-CoV-2 for phylogeographic analysis. The integrated Neo4j database acts as a comprehensive repository, consolidating COVID-19 pandemic-related sequences information, climate data, and demographic data obtained from public databases, facilitating efﬁcient ﬁltering and organization of input data for phylogeographical studies. Presently, the database encompasses over 113,774 nodes and 194,381 relationships. Additionally, aPhyloGeo-Covid provides a scalable and reproducible phylogeographic workﬂow for investigating the intricate relationship between geographic features and the patterns of variation in diverse SARS-CoV-2 variants. The code repository of platform is publicly accessible on GitHub (https://github.com/tahiri-lab/iPhyloGeo/tree/iPhylooGeo-neo4j), providing researchers with a valuable tool to analyze and explore the intricate dynamics of SARS-CoV-2 within a phylogeographic context.","PeriodicalId":364654,"journal":{"name":"Proceedings of the Python in Science Conference","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124256163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0