Miguel López-Otal , Fernando Domínguez-Castro , Borja Latorre , Javier Vela-Tambo , Jorge Gracia
{"title":"SeqIA: A Python framework for extracting drought impacts from news archives","authors":"Miguel López-Otal , Fernando Domínguez-Castro , Borja Latorre , Javier Vela-Tambo , Jorge Gracia","doi":"10.1016/j.envsoft.2025.106382","DOIUrl":null,"url":null,"abstract":"<div><div>Drought is a hazard that causes great economic, ecological, and human loss. With an ever-growing risk of climate change, their frequency and magnitude are expected to increase. While there are many indices and metrics available for the analysis of droughts, assessing their impacts represents one of the best ways to understand their magnitude and extent. However, there are no systematic records outlining these impacts.</div><div>To help in their ongoing creation, we present a software framework that leverages raw newspaper articles, identifies any drought-related ones, and automatically classifies them according to a set of socioeconomic impacts. The information is provided to the user in a structured format, including geographical coordinates and their date of reporting. Our approach employs state-of-the-art Transformer-based Natural Language Processing (NLP) techniques, which achieve great accuracy. We currently support newspaper articles in the Spanish language within Spain, but our framework can be expanded to other countries and languages.</div></div>","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":"187 ","pages":"Article 106382"},"PeriodicalIF":4.8000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1364815225000660","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Drought is a hazard that causes great economic, ecological, and human loss. With an ever-growing risk of climate change, their frequency and magnitude are expected to increase. While there are many indices and metrics available for the analysis of droughts, assessing their impacts represents one of the best ways to understand their magnitude and extent. However, there are no systematic records outlining these impacts.
To help in their ongoing creation, we present a software framework that leverages raw newspaper articles, identifies any drought-related ones, and automatically classifies them according to a set of socioeconomic impacts. The information is provided to the user in a structured format, including geographical coordinates and their date of reporting. Our approach employs state-of-the-art Transformer-based Natural Language Processing (NLP) techniques, which achieve great accuracy. We currently support newspaper articles in the Spanish language within Spain, but our framework can be expanded to other countries and languages.
期刊介绍:
Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.