{"title":"数据集设计在开发鲁棒U-Net模型用于无标签细胞形态评估中的重要性。","authors":"Takeru Shiina , Kazue Kimura , Yuto Takemoto , Kenjiro Tanaka , Ryuji Kato","doi":"10.1016/j.jbiosc.2025.01.004","DOIUrl":null,"url":null,"abstract":"<div><div>Advances in regenerative medicine highlighted the need for label-free cell image analysis to replace conventional microscopic observation for non-invasive cell quality evaluation. Image-based evaluation provides an efficient, quantitative, and automated approach to cell analysis, but segmentation remains a critical and challenging step. In this study, we investigated how training dataset design influenced the robustness of U-Net models for cell segmentation, focusing on challenges posed by limited data availability in cell culture. Using 2592 image pairs from four cell types representing key morphological categories, we constructed 42 investigation patterns to evaluate the effects of dataset size, dataset content, and morphological diversity on model performance. Our results showed that robust segmentation models could be developed with approximately 10 raw images captured using a 4× objective lens, a much smaller dataset than typically assumed. The dataset content was found to be crucial: training dataset images that captured commonly observed cell patterns yielded more robust models compared to those capturing rare or irregular cell patterns, which often impaired model performance with large deviations. Additionally, including both spindle and round cell morphologies in the training datasets improved model robustness when tested across all four cell types, while datasets restricted to a single morphology type could not achieve robust models. These findings highlight the importance of curating datasets that capture representative yet diverse cell morphologies. By addressing critical questions about dataset design, this study provides actionable guidance for the effective use of deep learning-based cell segmentation models in manufacturing and research applications.</div></div>","PeriodicalId":15199,"journal":{"name":"Journal of bioscience and bioengineering","volume":"139 4","pages":"Pages 329-339"},"PeriodicalIF":2.3000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Importance of dataset design in developing robust U-Net models for label-free cell morphology evaluation\",\"authors\":\"Takeru Shiina , Kazue Kimura , Yuto Takemoto , Kenjiro Tanaka , Ryuji Kato\",\"doi\":\"10.1016/j.jbiosc.2025.01.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Advances in regenerative medicine highlighted the need for label-free cell image analysis to replace conventional microscopic observation for non-invasive cell quality evaluation. Image-based evaluation provides an efficient, quantitative, and automated approach to cell analysis, but segmentation remains a critical and challenging step. In this study, we investigated how training dataset design influenced the robustness of U-Net models for cell segmentation, focusing on challenges posed by limited data availability in cell culture. Using 2592 image pairs from four cell types representing key morphological categories, we constructed 42 investigation patterns to evaluate the effects of dataset size, dataset content, and morphological diversity on model performance. Our results showed that robust segmentation models could be developed with approximately 10 raw images captured using a 4× objective lens, a much smaller dataset than typically assumed. The dataset content was found to be crucial: training dataset images that captured commonly observed cell patterns yielded more robust models compared to those capturing rare or irregular cell patterns, which often impaired model performance with large deviations. Additionally, including both spindle and round cell morphologies in the training datasets improved model robustness when tested across all four cell types, while datasets restricted to a single morphology type could not achieve robust models. These findings highlight the importance of curating datasets that capture representative yet diverse cell morphologies. By addressing critical questions about dataset design, this study provides actionable guidance for the effective use of deep learning-based cell segmentation models in manufacturing and research applications.</div></div>\",\"PeriodicalId\":15199,\"journal\":{\"name\":\"Journal of bioscience and bioengineering\",\"volume\":\"139 4\",\"pages\":\"Pages 329-339\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-02-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of bioscience and bioengineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1389172325000040\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of bioscience and bioengineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1389172325000040","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
Importance of dataset design in developing robust U-Net models for label-free cell morphology evaluation
Advances in regenerative medicine highlighted the need for label-free cell image analysis to replace conventional microscopic observation for non-invasive cell quality evaluation. Image-based evaluation provides an efficient, quantitative, and automated approach to cell analysis, but segmentation remains a critical and challenging step. In this study, we investigated how training dataset design influenced the robustness of U-Net models for cell segmentation, focusing on challenges posed by limited data availability in cell culture. Using 2592 image pairs from four cell types representing key morphological categories, we constructed 42 investigation patterns to evaluate the effects of dataset size, dataset content, and morphological diversity on model performance. Our results showed that robust segmentation models could be developed with approximately 10 raw images captured using a 4× objective lens, a much smaller dataset than typically assumed. The dataset content was found to be crucial: training dataset images that captured commonly observed cell patterns yielded more robust models compared to those capturing rare or irregular cell patterns, which often impaired model performance with large deviations. Additionally, including both spindle and round cell morphologies in the training datasets improved model robustness when tested across all four cell types, while datasets restricted to a single morphology type could not achieve robust models. These findings highlight the importance of curating datasets that capture representative yet diverse cell morphologies. By addressing critical questions about dataset design, this study provides actionable guidance for the effective use of deep learning-based cell segmentation models in manufacturing and research applications.
期刊介绍:
The Journal of Bioscience and Bioengineering is a research journal publishing original full-length research papers, reviews, and Letters to the Editor. The Journal is devoted to the advancement and dissemination of knowledge concerning fermentation technology, biochemical engineering, food technology and microbiology.