[Rapid identification and analysis of hemoglobin isoelectric focusing electrophoresis images based on deep learning].

Wei-Chen Ji, You-Li Tian, Hao-Dong Fu, Gen-Han Zha, Cheng-Xi Cao, Li Wei, Qiang Zhang
{"title":"[Rapid identification and analysis of hemoglobin isoelectric focusing electrophoresis images based on deep learning].","authors":"Wei-Chen Ji, You-Li Tian, Hao-Dong Fu, Gen-Han Zha, Cheng-Xi Cao, Li Wei, Qiang Zhang","doi":"10.3724/SP.J.1123.2024.05012","DOIUrl":null,"url":null,"abstract":"<p><p>Gel electrophoresis is used to separate and analyze macromolecules (such as DNA, RNA, and proteins) and their fragments, and highly reproducible and efficient automatic band-detection methods have been developed to analyze gel images. Uneven background, low contrast, lane distortion, blurred band edges, and geometric deformation pose detection-accuracy challenges during automatic band detection. In order to address these issues, various correction algorithms have been proposed; however, these algorithms rely on researcher experience to adjust and optimize parameters based on image characteristics, which introduces human error while qualitatively and quantitatively processing bands. Isoelectric focusing (IEF) gel electrophoresis separates proteins with high-resolution based on isoelectric point (pI) differences. Microarray IEF (mIEF) is used for the auxiliary diagnosis of diabetes and adult <i>β</i>-thalassemia owing to operational ease, low sample consumption, and high throughout. This diagnostic method relies on accurately positioning and precisely determining protein bands. To avoid errors associated with correction algorithms during band analysis, this paper introduces a method for rapidly recognizing bands in gel electrophoresis patterns that relies on a deep learning object detection algorithm, and uses it to quantify and classify the IEF electrophoresis pattern of hemoglobin (Hb). We used mIEF experiments to collect 1 665 pI-marker-free Hb IEF images as a model dataset to train the YOLOv8 model. The trained model accepts a Hb IEF image as input and infers band bounding boxes and classification results. Using inference data, the gray intensities of the pixels in each band area are summed to determine the content of each protein. The background and foreground of the image need to be separated prior to summing the abovementioned gray intensities, and the threshold method is used to achieve this. The threshold is defined as the average intensity of the background area, which is obtained by summing and averaging the background intensities of gel areas between the detection bounding boxes of each protein band. The baseline band areas are unified after removing the background. This method only requires the input image, directly outputs the corresponding electrophoretic band information, and does not rely on the experience of professionals nor is it affected by factors such as lane distortion or band deformation. In addition, the developed method does not depend on pI markers for qualitatively determining bands, thereby reducing experimental costs and improving detection efficiency. YOLOv8n delivered a detection accuracy of 92.9% and an inference time of 0.6 ms while using limited computing resources. Using Hb A2 as an example, we compared its content measured using the developed method with clinical data. The quantitative results were subjected to regression analysis, which delivered a linearity of 0.981 2 and a correlation coefficient of 0.980 0. We also used the Bland-Altman analysis method to verify that these two values are highly consistent. Compared with the traditional automatic band detection methods, the method developed in this study is fast, accurate, more repeatable, and stable, and can be used to determine the Hb A2 content in clinical practice, thereby potentially assisting in the auxiliary diagnosis of adult <i>β</i>-thalassemia.</p>","PeriodicalId":101336,"journal":{"name":"Se pu = Chinese journal of chromatography","volume":"43 6","pages":"696-704"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12093207/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Se pu = Chinese journal of chromatography","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3724/SP.J.1123.2024.05012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Gel electrophoresis is used to separate and analyze macromolecules (such as DNA, RNA, and proteins) and their fragments, and highly reproducible and efficient automatic band-detection methods have been developed to analyze gel images. Uneven background, low contrast, lane distortion, blurred band edges, and geometric deformation pose detection-accuracy challenges during automatic band detection. In order to address these issues, various correction algorithms have been proposed; however, these algorithms rely on researcher experience to adjust and optimize parameters based on image characteristics, which introduces human error while qualitatively and quantitatively processing bands. Isoelectric focusing (IEF) gel electrophoresis separates proteins with high-resolution based on isoelectric point (pI) differences. Microarray IEF (mIEF) is used for the auxiliary diagnosis of diabetes and adult β-thalassemia owing to operational ease, low sample consumption, and high throughout. This diagnostic method relies on accurately positioning and precisely determining protein bands. To avoid errors associated with correction algorithms during band analysis, this paper introduces a method for rapidly recognizing bands in gel electrophoresis patterns that relies on a deep learning object detection algorithm, and uses it to quantify and classify the IEF electrophoresis pattern of hemoglobin (Hb). We used mIEF experiments to collect 1 665 pI-marker-free Hb IEF images as a model dataset to train the YOLOv8 model. The trained model accepts a Hb IEF image as input and infers band bounding boxes and classification results. Using inference data, the gray intensities of the pixels in each band area are summed to determine the content of each protein. The background and foreground of the image need to be separated prior to summing the abovementioned gray intensities, and the threshold method is used to achieve this. The threshold is defined as the average intensity of the background area, which is obtained by summing and averaging the background intensities of gel areas between the detection bounding boxes of each protein band. The baseline band areas are unified after removing the background. This method only requires the input image, directly outputs the corresponding electrophoretic band information, and does not rely on the experience of professionals nor is it affected by factors such as lane distortion or band deformation. In addition, the developed method does not depend on pI markers for qualitatively determining bands, thereby reducing experimental costs and improving detection efficiency. YOLOv8n delivered a detection accuracy of 92.9% and an inference time of 0.6 ms while using limited computing resources. Using Hb A2 as an example, we compared its content measured using the developed method with clinical data. The quantitative results were subjected to regression analysis, which delivered a linearity of 0.981 2 and a correlation coefficient of 0.980 0. We also used the Bland-Altman analysis method to verify that these two values are highly consistent. Compared with the traditional automatic band detection methods, the method developed in this study is fast, accurate, more repeatable, and stable, and can be used to determine the Hb A2 content in clinical practice, thereby potentially assisting in the auxiliary diagnosis of adult β-thalassemia.

Abstract Image

Abstract Image

Abstract Image

[基于深度学习的血红蛋白等电聚焦电泳图像快速识别与分析]。
凝胶电泳用于分离和分析大分子(如DNA、RNA和蛋白质)及其片段,并且开发了高重复性和高效的自动带检测方法来分析凝胶图像。背景不均匀、对比度低、车道失真、频带边缘模糊、几何变形等对自动频带检测的精度提出了挑战。为了解决这些问题,人们提出了各种校正算法,然而,这些算法依赖于研究人员的经验来调整和优化基于图像特征的参数,这在定性和定量处理波段时引入了人为误差。等电聚焦(IEF)凝胶电泳基于等电点(pI)的差异,以高分辨率分离蛋白质。微阵列IEF (mIEF)由于操作简便、样品消耗少、通用性高,被用于糖尿病和成人β-地中海贫血的辅助诊断。这种诊断方法依赖于准确定位和精确测定蛋白质带。为了避免在条带分析过程中与校正算法相关的错误,本文介绍了一种基于深度学习对象检测算法的凝胶电泳模式条带快速识别方法,并利用该方法对血红蛋白(Hb)的IEF电泳模式进行量化和分类。我们使用mIEF实验收集了1 665张无pi标记的Hb IEF图像作为模型数据集,用于训练YOLOv8模型。训练后的模型接受Hb IEF图像作为输入,并推断频带边界框和分类结果。利用推理数据,对各波段像素的灰度值求和,确定各蛋白质的含量。在对上述灰度强度求和之前,需要将图像的背景和前景分开,使用阈值法来实现这一点。阈值定义为背景区域的平均强度,将每个蛋白带的检测边界盒之间的凝胶区域背景强度相加平均。去除背景后,基线带区域统一。该方法只需要输入图像,直接输出相应的电泳带信息,不依赖专业人员的经验,也不受车道畸变或带变形等因素的影响。此外,该方法不依赖pI标记物定性确定波段,从而降低了实验成本,提高了检测效率。在有限的计算资源下,YOLOv8n的检测精度为92.9%,推断时间为0.6 ms。以血红蛋白A2为例,我们比较了其含量测量的发展方法与临床数据。定量结果进行回归分析,线性关系为0.981 2,相关系数为0.980 0。我们还使用Bland-Altman分析法验证了这两个值是高度一致的。与传统自动带检测方法相比,本研究建立的方法快速、准确、重复性好、稳定性好,可用于临床测定Hb A2含量,为成人β-地中海贫血的辅助诊断提供潜在的帮助。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信