{"title":"A Hybrid Framework for Red Blood Cell Labeling Using Elliptical Fitting, Autoencoding, and Data Augmentation.","authors":"Bundasak Angmanee, Surasak Wanram, Amorn Thedsakhulwong","doi":"10.3390/jimaging11090309","DOIUrl":null,"url":null,"abstract":"<p><p>This study aimed to develop a local dataset of abnormal RBC morphology from confirmed cases of anemia and thalassemia in Thailand, providing a foundation for medical image analysis and future AI-assisted diagnostics. Blood smear samples from six hematological disorders were collected between April and May 2025, with twelve regions of interest segmented into approximately 34,000 single-cell images. To characterize cell variability, a convolutional autoencoder was applied to extract latent features, while ellipse fitting was used to quantify cell geometry. Expert hematologists validated representative clusters to ensure clinical accuracy, and data augmentation was employed to address class imbalance and expand rare morphological types. From the dataset, 14,089 high-quality single-cell images were used to classify RBC morphology into 36 clinically meaningful categories. Unlike existing datasets that rely on limited or curated samples, this dataset reflects population-specific characteristics and morphological diversity relevant to Southeast Asia. The results demonstrate the feasibility of establishing scalable and interpretable datasets that integrate computational methods with expert knowledge. The proposed dataset serves as a robust resource for advancing hematology research and contributes to bridging traditional diagnostics with AI-driven clinical support systems.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"11 9","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12470543/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/jimaging11090309","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
This study aimed to develop a local dataset of abnormal RBC morphology from confirmed cases of anemia and thalassemia in Thailand, providing a foundation for medical image analysis and future AI-assisted diagnostics. Blood smear samples from six hematological disorders were collected between April and May 2025, with twelve regions of interest segmented into approximately 34,000 single-cell images. To characterize cell variability, a convolutional autoencoder was applied to extract latent features, while ellipse fitting was used to quantify cell geometry. Expert hematologists validated representative clusters to ensure clinical accuracy, and data augmentation was employed to address class imbalance and expand rare morphological types. From the dataset, 14,089 high-quality single-cell images were used to classify RBC morphology into 36 clinically meaningful categories. Unlike existing datasets that rely on limited or curated samples, this dataset reflects population-specific characteristics and morphological diversity relevant to Southeast Asia. The results demonstrate the feasibility of establishing scalable and interpretable datasets that integrate computational methods with expert knowledge. The proposed dataset serves as a robust resource for advancing hematology research and contributes to bridging traditional diagnostics with AI-driven clinical support systems.