{"title":"低标注成本下花粉图像的深度学习:形态特征与训练预测策略的联合优化","authors":"Teng Zhang , Limi Mao","doi":"10.1016/j.revpalbo.2025.105458","DOIUrl":null,"url":null,"abstract":"<div><div>Pollen identification is of great importance in the fields of palynology, palaeoecology, botany, medicine and forensic science, but traditional microscopic morphological analysis methods are inefficient and subjective. In this study, we propose an innovative approach based on deep learning to improve the accuracy and efficiency of pollen identification. We constructed a high-quality pollen dataset containing 5521 images of 141 species and a structured attribute table containing 20 standardized morphological features. With an improved ResNet50 architecture, the model utilizes a masking mechanism to combine image features with morphological data, significantly improving classification performance. In addition, we propose a joint training strategy that utilizes both weakly labeled data (unlabeled images + some morphological features) and fully labeled data to alleviate the data scarcity problem. The experimental results show that with the introduction of morphological features, the accuracy of the model significantly improves from 83.00% to at least 89.49% and exhibits stronger generalization ability, effectively reducing overfitting. This study provides a scalable solution for automated pollen identification, addressing key challenges in data utilization and classification accuracy.</div></div>","PeriodicalId":54488,"journal":{"name":"Review of Palaeobotany and Palynology","volume":"344 ","pages":"Article 105458"},"PeriodicalIF":1.7000,"publicationDate":"2025-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep learning of pollen images under low annotation costs: joint optimization of morphological features and training and prediction strategies\",\"authors\":\"Teng Zhang , Limi Mao\",\"doi\":\"10.1016/j.revpalbo.2025.105458\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Pollen identification is of great importance in the fields of palynology, palaeoecology, botany, medicine and forensic science, but traditional microscopic morphological analysis methods are inefficient and subjective. In this study, we propose an innovative approach based on deep learning to improve the accuracy and efficiency of pollen identification. We constructed a high-quality pollen dataset containing 5521 images of 141 species and a structured attribute table containing 20 standardized morphological features. With an improved ResNet50 architecture, the model utilizes a masking mechanism to combine image features with morphological data, significantly improving classification performance. In addition, we propose a joint training strategy that utilizes both weakly labeled data (unlabeled images + some morphological features) and fully labeled data to alleviate the data scarcity problem. The experimental results show that with the introduction of morphological features, the accuracy of the model significantly improves from 83.00% to at least 89.49% and exhibits stronger generalization ability, effectively reducing overfitting. This study provides a scalable solution for automated pollen identification, addressing key challenges in data utilization and classification accuracy.</div></div>\",\"PeriodicalId\":54488,\"journal\":{\"name\":\"Review of Palaeobotany and Palynology\",\"volume\":\"344 \",\"pages\":\"Article 105458\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2025-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Review of Palaeobotany and Palynology\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0034666725001794\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PALEONTOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Review of Palaeobotany and Palynology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0034666725001794","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PALEONTOLOGY","Score":null,"Total":0}
Deep learning of pollen images under low annotation costs: joint optimization of morphological features and training and prediction strategies
Pollen identification is of great importance in the fields of palynology, palaeoecology, botany, medicine and forensic science, but traditional microscopic morphological analysis methods are inefficient and subjective. In this study, we propose an innovative approach based on deep learning to improve the accuracy and efficiency of pollen identification. We constructed a high-quality pollen dataset containing 5521 images of 141 species and a structured attribute table containing 20 standardized morphological features. With an improved ResNet50 architecture, the model utilizes a masking mechanism to combine image features with morphological data, significantly improving classification performance. In addition, we propose a joint training strategy that utilizes both weakly labeled data (unlabeled images + some morphological features) and fully labeled data to alleviate the data scarcity problem. The experimental results show that with the introduction of morphological features, the accuracy of the model significantly improves from 83.00% to at least 89.49% and exhibits stronger generalization ability, effectively reducing overfitting. This study provides a scalable solution for automated pollen identification, addressing key challenges in data utilization and classification accuracy.
期刊介绍:
The Review of Palaeobotany and Palynology is an international journal for articles in all fields of palaeobotany and palynology dealing with all groups, ranging from marine palynomorphs to higher land plants. Original contributions and comprehensive review papers should appeal to an international audience. Typical topics include but are not restricted to systematics, evolution, palaeobiology, palaeoecology, biostratigraphy, biochronology, palaeoclimatology, paleogeography, taphonomy, palaeoenvironmental reconstructions, vegetation history, and practical applications of palaeobotany and palynology, e.g. in coal and petroleum geology and archaeology. The journal especially encourages the publication of articles in which palaeobotany and palynology are applied for solving fundamental geological and biological problems as well as innovative and interdisciplinary approaches.