{"title":"StarNet:用于质量和成熟度评估的印度星醋栗数据集","authors":"Pushpa B․ R․ , Manohar N․ , N. Shobha Rani","doi":"10.1016/j.dib.2025.111825","DOIUrl":null,"url":null,"abstract":"<div><div>Star gooseberry provides immense health benefits and is widely recognized in the Indian medicinal system. It holds significant importance in the food production, pharmaceuticals, and cosmetics industries due to the presence of therapeutic and pharmacological properties. Due to its beneficial properties, gooseberry fruit is widely used in treating various ailments. Therefore, cultivating these fruits presents an opportunity to generate revenue, benefiting both farmers and the agricultural sector. The post-harvest process of fruit typically performs the quality assessment by segregating fruits based on visual characteristics, which is tedious and prone to human error. Hence, there is a need to develop an automated computer vision model to assess the fruit quality more accurately. This study focuses on dataset collection, including image samples of both single and multiple-star gooseberry fruits to automate fruit grading. This dataset has been specifically developed for research purposes, contributing to fruit detection, quality assessment, weight estimation, and classification of fruits at various ripeness stages. Further, it provides researchers with an opportunity to develop an automated system for detecting overlapping fruits and touching contours using machine learning, deep learning, and computer vision systems. Image samples of star gooseberry at different growth stages were collected from orchids in Mysuru, India. The dataset, named “AmlaNet” comprises 792 image samples of star gooseberry, captured against a plain background from varying angles, sizes, brightness levels, and distances. The dataset is organized into four folders such as single star gooseberry fruit, multiple fruits, overlapped, and annotated samples of overlapped star gooseberry fruits including fruit samples with different ripeness stages. This publicly accessible dataset is expected to benefit the research community, enabling advancement in computer vision and AI Applications. It can be accessed at DOI: <span><span>10.17632/2255bdy9mm.1</span><svg><path></path></svg></span></div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"61 ","pages":"Article 111825"},"PeriodicalIF":1.4000,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"StarNet: Indian star gooseberries dataset for quality and maturity assessment\",\"authors\":\"Pushpa B․ R․ , Manohar N․ , N. Shobha Rani\",\"doi\":\"10.1016/j.dib.2025.111825\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Star gooseberry provides immense health benefits and is widely recognized in the Indian medicinal system. It holds significant importance in the food production, pharmaceuticals, and cosmetics industries due to the presence of therapeutic and pharmacological properties. Due to its beneficial properties, gooseberry fruit is widely used in treating various ailments. Therefore, cultivating these fruits presents an opportunity to generate revenue, benefiting both farmers and the agricultural sector. The post-harvest process of fruit typically performs the quality assessment by segregating fruits based on visual characteristics, which is tedious and prone to human error. Hence, there is a need to develop an automated computer vision model to assess the fruit quality more accurately. This study focuses on dataset collection, including image samples of both single and multiple-star gooseberry fruits to automate fruit grading. This dataset has been specifically developed for research purposes, contributing to fruit detection, quality assessment, weight estimation, and classification of fruits at various ripeness stages. Further, it provides researchers with an opportunity to develop an automated system for detecting overlapping fruits and touching contours using machine learning, deep learning, and computer vision systems. Image samples of star gooseberry at different growth stages were collected from orchids in Mysuru, India. The dataset, named “AmlaNet” comprises 792 image samples of star gooseberry, captured against a plain background from varying angles, sizes, brightness levels, and distances. The dataset is organized into four folders such as single star gooseberry fruit, multiple fruits, overlapped, and annotated samples of overlapped star gooseberry fruits including fruit samples with different ripeness stages. This publicly accessible dataset is expected to benefit the research community, enabling advancement in computer vision and AI Applications. It can be accessed at DOI: <span><span>10.17632/2255bdy9mm.1</span><svg><path></path></svg></span></div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"61 \",\"pages\":\"Article 111825\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340925005529\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925005529","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
StarNet: Indian star gooseberries dataset for quality and maturity assessment
Star gooseberry provides immense health benefits and is widely recognized in the Indian medicinal system. It holds significant importance in the food production, pharmaceuticals, and cosmetics industries due to the presence of therapeutic and pharmacological properties. Due to its beneficial properties, gooseberry fruit is widely used in treating various ailments. Therefore, cultivating these fruits presents an opportunity to generate revenue, benefiting both farmers and the agricultural sector. The post-harvest process of fruit typically performs the quality assessment by segregating fruits based on visual characteristics, which is tedious and prone to human error. Hence, there is a need to develop an automated computer vision model to assess the fruit quality more accurately. This study focuses on dataset collection, including image samples of both single and multiple-star gooseberry fruits to automate fruit grading. This dataset has been specifically developed for research purposes, contributing to fruit detection, quality assessment, weight estimation, and classification of fruits at various ripeness stages. Further, it provides researchers with an opportunity to develop an automated system for detecting overlapping fruits and touching contours using machine learning, deep learning, and computer vision systems. Image samples of star gooseberry at different growth stages were collected from orchids in Mysuru, India. The dataset, named “AmlaNet” comprises 792 image samples of star gooseberry, captured against a plain background from varying angles, sizes, brightness levels, and distances. The dataset is organized into four folders such as single star gooseberry fruit, multiple fruits, overlapped, and annotated samples of overlapped star gooseberry fruits including fruit samples with different ripeness stages. This publicly accessible dataset is expected to benefit the research community, enabling advancement in computer vision and AI Applications. It can be accessed at DOI: 10.17632/2255bdy9mm.1
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.