Using shell shape analysis based on landmarks to trace the geographical origin of the common cockle (Cerastoderma edule)

IF 5.6 1区 农林科学 Q1 FOOD SCIENCE & TECHNOLOGY
{"title":"Using shell shape analysis based on landmarks to trace the geographical origin of the common cockle (Cerastoderma edule)","authors":"","doi":"10.1016/j.foodcont.2024.110662","DOIUrl":null,"url":null,"abstract":"<div><p>Determining the geographical origin of seafood is crucial for regulators and fishing industries who seek to prevent commercial fraud, enforce food safety regulations, and encourage high standards in sustainable fisheries management. The cockle, <em>Cerastoderma edule</em> (Linnaeus 1768), is a key species in estuarine ecosystems and is harvested all over Europe. Therefore, traceability tools using quick and inexpensive techniques to identify the origin of this bivalve are of paramount importance to support law enforcement. In this work, we explore the potential of using Geometric Morphometric (GM) methods to identify the geographical origin of cockle specimens. This method is based on landmarks identified in the shell to trace the origin of specimens obtained in nearby aquatic systems (from &lt;35 km to &lt;250 km distance). Specimens were collected in five aquatic systems (Ria de Aveiro, the Tagus and Sado estuaries, and the Albufeira and Óbidos coastal lagoons) in Portugal. Shells were digitalized and 16 landmarks were identified in each right valve and analyzed using Generalized Procrustes Superimposition. The discriminating power for traceability of 12 statistical and machine learning methods was assessed based on the corresponding shape variables, using R and Python (Linear Discriminant Analysis (LDA), Canonical Variable Analysis (CVA), Principal Component Analysis (PCA), Between-Group PCA (bgPCA), Partial Least Squares Discriminant (PLSD), Classification Regression Tree (CRT), Logistic Regression (LR), Random Forest (RF), Gradient Boosting (GB), K Nearest Neighbors (KNN), Support Vector Machines (SVM), Extending Gradient Boosting (XGBoost) and Neural Networks (NNET). LDA, CVA, SVM, and NNET demonstrated greater accuracy and a F1-score &gt;80%, even with a small and unbalanced sample size. The highest percentage of correctly assigned individuals was obtained in the Tagus estuary (mean 89%) and in the Albufeira lagoon (mean 93%), which were also the systems with more specimens measured (174 and 59 respectively), whereas the worst results were obtained in the Sado estuary (50%, 56 specimens). In the Albufeira coastal lagoon, the best classification methods reached 100% correct classifications. It further highlights the importance of establishing statistical standards, such as the ones developed in the current work, to evaluate different methods, as small changes in the procedure may cause substantial differences in the results and conclusions. The revision of previous works (presented as a table) showed often &gt;90% of correct classification in both bivalves and gastropods, highlighting the potential of the techniques for other mollusks. Our results support the use of GM based on landmarks as a reliable tool for bivalve's traceability, since it is a quick, simple and inexpensive approach. Further research should extend these findings to other species and other shape analysis techniques.</p></div>","PeriodicalId":319,"journal":{"name":"Food Control","volume":null,"pages":null},"PeriodicalIF":5.6000,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0956713524003797/pdfft?md5=95187035c728ff5f8c676a41bb7c89a1&pid=1-s2.0-S0956713524003797-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Food Control","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0956713524003797","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Determining the geographical origin of seafood is crucial for regulators and fishing industries who seek to prevent commercial fraud, enforce food safety regulations, and encourage high standards in sustainable fisheries management. The cockle, Cerastoderma edule (Linnaeus 1768), is a key species in estuarine ecosystems and is harvested all over Europe. Therefore, traceability tools using quick and inexpensive techniques to identify the origin of this bivalve are of paramount importance to support law enforcement. In this work, we explore the potential of using Geometric Morphometric (GM) methods to identify the geographical origin of cockle specimens. This method is based on landmarks identified in the shell to trace the origin of specimens obtained in nearby aquatic systems (from <35 km to <250 km distance). Specimens were collected in five aquatic systems (Ria de Aveiro, the Tagus and Sado estuaries, and the Albufeira and Óbidos coastal lagoons) in Portugal. Shells were digitalized and 16 landmarks were identified in each right valve and analyzed using Generalized Procrustes Superimposition. The discriminating power for traceability of 12 statistical and machine learning methods was assessed based on the corresponding shape variables, using R and Python (Linear Discriminant Analysis (LDA), Canonical Variable Analysis (CVA), Principal Component Analysis (PCA), Between-Group PCA (bgPCA), Partial Least Squares Discriminant (PLSD), Classification Regression Tree (CRT), Logistic Regression (LR), Random Forest (RF), Gradient Boosting (GB), K Nearest Neighbors (KNN), Support Vector Machines (SVM), Extending Gradient Boosting (XGBoost) and Neural Networks (NNET). LDA, CVA, SVM, and NNET demonstrated greater accuracy and a F1-score >80%, even with a small and unbalanced sample size. The highest percentage of correctly assigned individuals was obtained in the Tagus estuary (mean 89%) and in the Albufeira lagoon (mean 93%), which were also the systems with more specimens measured (174 and 59 respectively), whereas the worst results were obtained in the Sado estuary (50%, 56 specimens). In the Albufeira coastal lagoon, the best classification methods reached 100% correct classifications. It further highlights the importance of establishing statistical standards, such as the ones developed in the current work, to evaluate different methods, as small changes in the procedure may cause substantial differences in the results and conclusions. The revision of previous works (presented as a table) showed often >90% of correct classification in both bivalves and gastropods, highlighting the potential of the techniques for other mollusks. Our results support the use of GM based on landmarks as a reliable tool for bivalve's traceability, since it is a quick, simple and inexpensive approach. Further research should extend these findings to other species and other shape analysis techniques.

利用基于地标的贝壳形状分析追溯普通毛蚶(Cerastoderma edule)的地理起源
确定海产品的地理原产地对于监管机构和渔业来说至关重要,因为他们要防止商业欺诈,执行食品安全法规,鼓励高标准的可持续渔业管理。毛蚶(Cerastoderma edule,林尼厄斯 1768 年)是河口生态系统中的重要物种,在欧洲各地均有捕捞。因此,使用快速、廉价的技术来识别这种双壳贝类来源的可追溯性工具对于支持执法至关重要。在这项工作中,我们探索了使用几何形态计量(GM)方法识别毛蚶标本地理来源的可能性。该方法基于贝壳中的地标,追溯在附近水域系统(从 35 千米到 250 千米)采集的标本的来源。标本采集于葡萄牙的五个水系(阿威罗河、塔霍河和萨多河口以及阿尔布费拉和奥比多斯沿海泻湖)。对贝壳进行了数字化处理,在每个右瓣膜上确定了 16 个地标,并使用广义普罗克鲁斯叠加法进行了分析。根据相应的形状变量,使用 R 和 Python 评估了 12 种统计和机器学习方法(线性判别分析 (LDA)、典型变量分析 (CVA)、主成分分析 (PCA)、组间 PCA (bgPCA)、部分最小二乘法判别分析 (PLSD)、分类回归树 (CRT)、逻辑回归 (LR)、随机森林 (RF)、梯度提升 (GB)、K 最近邻 (KNN)、支持向量机 (SVM)、扩展梯度提升 (XGBoost) 和神经网络 (NNET)。LDA、CVA、SVM 和 NNET 显示出更高的准确率和 80% 的 F1 分数,即使样本量小且不平衡。塔霍河口(平均 89%)和阿尔布费拉泻湖(平均 93%)的个体分配正确率最高,也是测量标本较多的系统(分别为 174 个和 59 个),而萨多河口的结果最差(50%,56 个标本)。在阿尔布费拉沿海泻湖,最佳分类方法的正确率达到 100%。这进一步凸显了建立统计标准的重要性,如当前工作中制定的标准,以评估不同的方法,因为程序的微小变化可能会导致结果和结论的巨大差异。对以前工作的修订(以表格形式呈现)显示,双壳类和腹足类的分类正确率通常为 90%,这凸显了该技术在其他软体动物中的应用潜力。我们的研究结果支持使用基于地标的基因改造技术作为双壳类动物溯源的可靠工具,因为它是一种快速、简单且成本低廉的方法。进一步的研究应将这些发现扩展到其他物种和其他形状分析技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Food Control
Food Control 工程技术-食品科技
CiteScore
12.20
自引率
6.70%
发文量
758
审稿时长
33 days
期刊介绍: Food Control is an international journal that provides essential information for those involved in food safety and process control. Food Control covers the below areas that relate to food process control or to food safety of human foods: • Microbial food safety and antimicrobial systems • Mycotoxins • Hazard analysis, HACCP and food safety objectives • Risk assessment, including microbial and chemical hazards • Quality assurance • Good manufacturing practices • Food process systems design and control • Food Packaging technology and materials in contact with foods • Rapid methods of analysis and detection, including sensor technology • Codes of practice, legislation and international harmonization • Consumer issues • Education, training and research needs. The scope of Food Control is comprehensive and includes original research papers, authoritative reviews, short communications, comment articles that report on new developments in food control, and position papers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信