{"title":"Combine, assign or delete? How to resolve different levels of taxonomic identification in chironomid datasets.","authors":"Oliver Heiri, Stefan Engels","doi":"10.1007/s10933-026-00387-1","DOIUrl":null,"url":null,"abstract":"<p><p>The analysis of fossil biotic remains in lake sediment records allows for the reconstruction of past ecosystem dynamics and as such forms a powerful tool for understanding ecosystem processes and environmental change on a range of spatial and temporal scales. However, when producing palaeoecological datasets, analysts are often not able to assign all specimens to morphotypes at the highest taxonomic resolution. As a result, datasets containing unprocessed or raw counts usually include categories of identification across multiple different taxonomic levels (e.g. species morphotype, genus, tribe or even family level), also for fossil remains that may actually have originated from the same taxon (e.g. the same species or species morphotype). Whilst different strategies to deal with this problem have emerged over the years, it is rarely described in papers how analysts dealt with this issue, and how datasets were processed from raw counts to a final dataset used for palaeoecological interpretation, numerical analysis or quantitative inference. Using chironomid identifications as an example, we here describe the four main strategies for dealing with such multi-level identifications that can be applied to unprocessed count data with different levels of taxonomic detail: combining, retaining, deleting and assigning. We discuss advantages and disadvantages of each of these strategies and illustrate their impacts on palaeoecological analyses using both theoretical and practical examples. We conclude that there is no one optimal way to deal with the issue of incomplete or multi-level taxonomic identifications of fossils that may have originated from the same taxon, but that analysts will have to determine the strategy that best befits their project on a case-by-case basis, taking into account the type and quality of data they are working with, as well as the overall aims of their research project. We recommend that the choice of strategy (or combined strategies) is clearly described in the form of a brief statement in the Methods section of manuscripts where primary data are presented and that ideally both the unprocessed raw count data and the processed data used for ecological and numerical analyses are archived where publication outlets allow for this. Whilst we use chironomid research as the primary example throughout this manuscript, the proposed strategies and recommendations are relevant to a wide range of microfossil groups.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s10933-026-00387-1.</p>","PeriodicalId":16658,"journal":{"name":"Journal of Paleolimnology","volume":"74 2","pages":"6"},"PeriodicalIF":1.3000,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13035590/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Paleolimnology","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s10933-026-00387-1","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/3/30 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The analysis of fossil biotic remains in lake sediment records allows for the reconstruction of past ecosystem dynamics and as such forms a powerful tool for understanding ecosystem processes and environmental change on a range of spatial and temporal scales. However, when producing palaeoecological datasets, analysts are often not able to assign all specimens to morphotypes at the highest taxonomic resolution. As a result, datasets containing unprocessed or raw counts usually include categories of identification across multiple different taxonomic levels (e.g. species morphotype, genus, tribe or even family level), also for fossil remains that may actually have originated from the same taxon (e.g. the same species or species morphotype). Whilst different strategies to deal with this problem have emerged over the years, it is rarely described in papers how analysts dealt with this issue, and how datasets were processed from raw counts to a final dataset used for palaeoecological interpretation, numerical analysis or quantitative inference. Using chironomid identifications as an example, we here describe the four main strategies for dealing with such multi-level identifications that can be applied to unprocessed count data with different levels of taxonomic detail: combining, retaining, deleting and assigning. We discuss advantages and disadvantages of each of these strategies and illustrate their impacts on palaeoecological analyses using both theoretical and practical examples. We conclude that there is no one optimal way to deal with the issue of incomplete or multi-level taxonomic identifications of fossils that may have originated from the same taxon, but that analysts will have to determine the strategy that best befits their project on a case-by-case basis, taking into account the type and quality of data they are working with, as well as the overall aims of their research project. We recommend that the choice of strategy (or combined strategies) is clearly described in the form of a brief statement in the Methods section of manuscripts where primary data are presented and that ideally both the unprocessed raw count data and the processed data used for ecological and numerical analyses are archived where publication outlets allow for this. Whilst we use chironomid research as the primary example throughout this manuscript, the proposed strategies and recommendations are relevant to a wide range of microfossil groups.
Supplementary information: The online version contains supplementary material available at 10.1007/s10933-026-00387-1.
期刊介绍:
The realization that a historical perspective is often useful, if not essential, to the understanding of most limnological processes has resulted in the recent surge of interest in paleolimnology. The main aim of the Journal of Paleolimnology is the provision of a vehicle for the rapid dissemination of original scientific work dealing with the reconstruction of lake histories. Although the majority of papers deal with lakes, paleoenvironmental studies of river, wetland, peatland and estuary systems are also eligible for publication.
The Journal of Paleolimnology, like the subject itself, is multidisciplinary in nature, and papers are published that are concerned with all aspects (e.g. biological, chemical, physical, geological, etc.) of the reconstruction and interpretation of lake histories. Both applied and more theoretical papers are equally encouraged. The Journal of Paleolimnology will continue to be a major repository for papers dealing with climatic change, as well as other pressing topics, such as global environmental change, lake acidification, eutrophication, long-term monitoring, and other aspects of lake ontogeny. Taxonomic and methodological papers are also acceptable provided they are of relatively broad interest. New equipment designs are frequently featured. In addition to original data and ideas, the Journal of Paleolimnology also publishes review articles, commentaries and program announcements. A relevant Book Review Section is also featured.