Mashood M. Mohsan , Basma B. Hasanen , Taimur Hassan , Muhayy Ud Din , Naoufel Werghi , Lakmal Seneviratne , Irfan Hussain
{"title":"SwishFormer for robust firmness and ripeness recognition of fruits using visual tactile imagery","authors":"Mashood M. Mohsan , Basma B. Hasanen , Taimur Hassan , Muhayy Ud Din , Naoufel Werghi , Lakmal Seneviratne , Irfan Hussain","doi":"10.1016/j.postharvbio.2025.113487","DOIUrl":null,"url":null,"abstract":"<div><div>The accurate assessment of fruit ripeness is a critical task in the agricultural industry. It affects the fruit quality, shelf-life, and consumer satisfaction. Traditional methods for estimating fruit ripeness rely on subjective human judgment and invasive sampling techniques, which are both infeasible and time-consuming. This paper presents a novel method for estimating firmness and ripeness of fruits using their palpation motion encoded within the visual tactile scans. Moreover, these tactile scans are passed to the proposed SwishFormer model coupled with Random Forest head to predict the fruits firmness, which is later used in classifying the fruits ripeness stage. SwishFormer, unlike the existing state-of-the-art models, encompasses hardswish activation as a token mixer which allows it to generate distinctive set of features from the candidate tactile scans. These rich feature representations are then fed to the Random Forest regressor to robustly estimate the fruit firmness values and the estimated firmness values are then used in accurately predicting the ripeness level of the fruits. Apart from this, SwishFormer is extensively evaluated on the proposed dataset, containing the palpation visual tactile scans, and it outperforms state-of-the-art works by achieving 4.77%, 4.09%, 13.69%, and 4.65% better performance in terms of MSE, RMSE, R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>, and MAE scores, while possessing 2.02 times less parameters, and 2.09 times lesser GMACs. Additionally, the ripeness recognition performance of the proposed system is thoroughly tested through real-world experiments using a Stretch Robot, where it achieves a success rate of 96.6%, 98.3%, and 93.3% for recognizing avocados as underripe, ripe, and overripe, respectively. To the best of our knowledge, this paper introduces a first non-destructive approach to estimate fruit firmness and ripeness using off-the-shelf vision-based tactile information. Moreover, the proposed dataset and source code of this work is available at <span><span>https://mashood3624.github.io/SwishFormer/</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":20328,"journal":{"name":"Postharvest Biology and Technology","volume":"225 ","pages":"Article 113487"},"PeriodicalIF":6.4000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Postharvest Biology and Technology","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925521425000997","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
The accurate assessment of fruit ripeness is a critical task in the agricultural industry. It affects the fruit quality, shelf-life, and consumer satisfaction. Traditional methods for estimating fruit ripeness rely on subjective human judgment and invasive sampling techniques, which are both infeasible and time-consuming. This paper presents a novel method for estimating firmness and ripeness of fruits using their palpation motion encoded within the visual tactile scans. Moreover, these tactile scans are passed to the proposed SwishFormer model coupled with Random Forest head to predict the fruits firmness, which is later used in classifying the fruits ripeness stage. SwishFormer, unlike the existing state-of-the-art models, encompasses hardswish activation as a token mixer which allows it to generate distinctive set of features from the candidate tactile scans. These rich feature representations are then fed to the Random Forest regressor to robustly estimate the fruit firmness values and the estimated firmness values are then used in accurately predicting the ripeness level of the fruits. Apart from this, SwishFormer is extensively evaluated on the proposed dataset, containing the palpation visual tactile scans, and it outperforms state-of-the-art works by achieving 4.77%, 4.09%, 13.69%, and 4.65% better performance in terms of MSE, RMSE, R, and MAE scores, while possessing 2.02 times less parameters, and 2.09 times lesser GMACs. Additionally, the ripeness recognition performance of the proposed system is thoroughly tested through real-world experiments using a Stretch Robot, where it achieves a success rate of 96.6%, 98.3%, and 93.3% for recognizing avocados as underripe, ripe, and overripe, respectively. To the best of our knowledge, this paper introduces a first non-destructive approach to estimate fruit firmness and ripeness using off-the-shelf vision-based tactile information. Moreover, the proposed dataset and source code of this work is available at https://mashood3624.github.io/SwishFormer/.
期刊介绍:
The journal is devoted exclusively to the publication of original papers, review articles and frontiers articles on biological and technological postharvest research. This includes the areas of postharvest storage, treatments and underpinning mechanisms, quality evaluation, packaging, handling and distribution of fresh horticultural crops including fruit, vegetables, flowers and nuts, but excluding grains, seeds and forages.
Papers reporting novel insights from fundamental and interdisciplinary research will be particularly encouraged. These disciplines include systems biology, bioinformatics, entomology, plant physiology, plant pathology, (bio)chemistry, engineering, modelling, and technologies for nondestructive testing.
Manuscripts on fresh food crops that will be further processed after postharvest storage, or on food processes beyond refrigeration, packaging and minimal processing will not be considered.