A serum exosomal microRNA-based artificial intelligence diagnostic model for highly accurate detection of hepatocellular carcinoma

IF 24.9 1区 医学 Q1 ONCOLOGY
Jin-Seong Hwang, Sugi Lee, Gyeonghwa Kim, Hoibin Jeong, Kiyoon Kwon, Eunsun Jung, Yuna Roh, Taesang Son, Hana Lee, Moo-Seung Lee, Kyoung-Jin Oh, Hye Won Lee, Yu Rim Lee, Soo Young Park, Won Young Tak, Hyun Seung Ban, Hyun-Soo Cho, Mi-Young Son, Jang-Seong Kim, Keun Hur, Dae-Soo Kim, Tae-Su Han
{"title":"A serum exosomal microRNA-based artificial intelligence diagnostic model for highly accurate detection of hepatocellular carcinoma","authors":"Jin-Seong Hwang,&nbsp;Sugi Lee,&nbsp;Gyeonghwa Kim,&nbsp;Hoibin Jeong,&nbsp;Kiyoon Kwon,&nbsp;Eunsun Jung,&nbsp;Yuna Roh,&nbsp;Taesang Son,&nbsp;Hana Lee,&nbsp;Moo-Seung Lee,&nbsp;Kyoung-Jin Oh,&nbsp;Hye Won Lee,&nbsp;Yu Rim Lee,&nbsp;Soo Young Park,&nbsp;Won Young Tak,&nbsp;Hyun Seung Ban,&nbsp;Hyun-Soo Cho,&nbsp;Mi-Young Son,&nbsp;Jang-Seong Kim,&nbsp;Keun Hur,&nbsp;Dae-Soo Kim,&nbsp;Tae-Su Han","doi":"10.1002/cac2.70043","DOIUrl":null,"url":null,"abstract":"<p>Hepatocellular carcinoma (HCC) is a critical cancer worldwide due to its low survival rate [<span>1</span>]. In the United States, the overall 5-year survival rate of patients with HCC is 22%, which decreases sharply with cancer progression [<span>2</span>]. Early detection of HCC improves patient survival. Serum alpha-fetoprotein (AFP) is a widely used biomarker for the diagnosis of HCC, but it is often elevated in patients with cirrhosis, resulting in false-positive results [<span>3</span>]. Diagnostic markers for early detection of HCC have been investigated previously [<span>4</span>], but none are widely applied in clinical settings. HCC pathogenesis is closely associated with hepatitis B and C virus (HBV and HCV) infections, which induce chronic inflammation, leading to cirrhosis and elevating the risk of malignant transformation [<span>5</span>]. Environmental and lifestyle factors, such as diet and alcohol consumption, drive the progression from steatosis to fibrosis, cirrhosis, and eventually HCC [<span>6</span>]. Due to HCC's multifactorial etiology and prolonged progression, identifying early diagnostic biomarkers remains a challenge [<span>7</span>]. Thus, integrating analyses of both pre-HCC and cancerous samples is essential for developing robust early detection strategies. This study aimed to: (1) establish stepwise animal models for HCC-related conditions including non-alcoholic steatohepatitis (NASH) and fibrosis [<span>8</span>]; (2) identify exosomal microRNA (exo-miRNA) signatures for early HCC diagnosis; and (3) develop and validate an artificial intelligence (AI)-based multi-marker model combining exo-miRNAs and AFP levels for accurate HCC diagnosis using clinical samples (Supplementary Figure S1, Supplementary Materials and Methods).</p><p>Initially, stepwise mouse models of liver diseases were developed (Figure 1A-C, Supplementary Figure S2A-B). Highly similar gene expression patterns between mouse and human liver diseases were discovered using transcriptome analysis of liver tissues and comparison with public databases (Supplementary Figure S2C). The serum exosomes were then isolated and characterized (Figure 1D, Supplementary Figure S2D), followed by exo-miRNA profiling using Nanostring analysis. This profiling was conducted on mouse models of liver diseases and human samples, which included healthy individuals (<i>n</i> = 7), patients with cirrhosis (<i>n</i> = 6), and patients with HCC (<i>n</i> = 18) (profiling set; Supplementary Table S1).</p><p>The selection criteria for exo-miRNAs included the upregulation of exo-miRNAs in serum exosomes from mice or humans with liver diseases compared with levels in exosomes from normal mice or healthy controls. Four exo-miRNAs were upregulated in samples from mice or humans with HCC compared with those from normal mice or healthy controls. Subsequently, additional criteria were applied to distinguish between cirrhosis and HCC using the mouse model, resulting in 4 additional exo-miRNAs. Eight exo-miRNAs were finally identified: miR-22-3p, miR-30a-5p, miR-30e-5p, miR-122-5p, miR-192-5p, miR-432-5p, miR-483-5p, and miR-574-5p (Figure 1E, Supplementary Figure S3). These 8 exo-miRNAs were significantly upregulated in the serum exosomes from patients with HCC compared with those from healthy controls, as confirmed by public data analysis (Supplementary Figure S4).</p><p>To validate the expression of 8 exo-miRNAs in human serum, quantitative PCR (qPCR) was performed on a training set (<i>n</i> = 195) including healthy controls and patients with cirrhosis and with HCC (training set; Supplementary Table S2). Five exo-miRNAs (miR-22-3p, miR-122-5p, miR-192-5p, miR-483-5p, and miR-574-5p) were significantly upregulated in patients with HCC compared to controls, with exo-miR122-5p showing the highest area under the curve (AUC; 0.95). All exo-miRNAs were significantly elevated in patients with HCC compared to patients with cirrhosis, with AUC values of 0.72-0.88 (Supplementary Figure S5A, Figure 1F). Validation in a new cohort (<i>n</i> = 175; Supplementary Table S3) confirmed that all exo-miRNAs were significantly upregulated in HCC patients compared to healthy controls, with miR-122-5p exhibiting the highest AUC (0.99). Five exo-miRNAs were significantly elevated in patients with HCC compared to patients with hepatitis and cirrhosis (Supplementary Figure S5B-C, Figure 1G). Whereas individual exo-miRNAs demonstrated strong diagnostic potential, combining them into a multi-marker approach was essential to address cohort variability and enhance diagnostic accuracy for HCC (Figure 1H).</p><p>To develop a robust diagnostic approach for HCC, AI-based deep-learning was used to construct a multi-marker model incorporating exo-miRNAs and AFP levels (Figure 1I). Prior to model development, batch effect correction was applied to the qPCR data to ensure consistency (Supplementary Figure S6). The model was developed using a training set comprising samples from healthy controls, patients with cirrhosis, HCC, gastric, and colorectal cancer samples (Supplementary Table S2). Diagnostic groups were formed based on AFP levels, 4 exo-miRNAs (miRNA set 1: miR-22-3p, miR-30a-5p, miR-122-5p, and miR-192-5p) with or without AFP levels, 4 mouse-specific exo-miRNAs (miRNA set 2: miR-30e-5p, miR-432-5p, miR-483-5p, and miR-574-5p) with or without AFP levels, and a total of 8 exo-miRNAs (miRNA set 3: combining miRNA set 1 and set 2) with or without AFP levels. The AI model classified the samples into three categories: control, cirrhosis, and HCC. Receiver operating characteristic (ROC) curve analysis indicated that AFP alone could effectively differentiate HCC from gastric and colorectal cancers (AUC &gt; 0.96); combining AFP with miRNA set 1 or set 3 improved performances (AUC = 1.00) (Supplementary Figure S7).</p><p>To distinguish healthy controls from patients with HCC, AFP alone achieved moderate diagnostic performance (training set, AUC = 0.94; validation set, AUC = 0.90), whereas miRNA set 1 or set 3 combined with AFP substantially enhanced accuracy (training set, AUC &gt; 0.97; validation set, AUC &gt; 0.95) (Figure 1J, Supplementary Figure S8A, Supplementary Table S4). The miRNA set 3 combined with AFP demonstrated superior diagnostic capability for early-stage HCC (AUC &gt; 0.94), surpassing AFP alone (AUC &gt; 0.89) in both the training and validation sets (Figure 1K, Supplementary Figure S8B, Supplementary Table S5). Using hepatitis or NASH as a control, miRNA set 3 achieved an AUC of 0.96 and 0.94, respectively, which was higher than miRNA set 1 (AUC = 0.73 and 0.69, respectively) (Supplementary Figure S8C). To distinguish cirrhosis from HCC, AFP alone and its combination with miRNA set 1 yielded low performance (validation sets, AUC = 0.57 and 0.49, respectively). However, the miRNA set 3 with AFP achieved high performance (training set, AUC = 1.00; validation set, AUC = 0.90) (Figure 1L, Supplementary Figure S8D, Supplementary Table S6). The miRNA set 3 with AFP consistently outperformed AFP alone for early-stage HCC detection and differentiation from cirrhosis, highlighting its potential as an effective diagnostic tool for HCC (Figure 1M, Supplementary Figure S8E, Supplementary Table S7).</p><p>To mitigate overfitting, the synthetic minority oversampling technique (SMOTE) was applied to the miRNA set 3 with AFP model. To distinguish HCC from controls, the AUC values were 0.97 (training) and 0.96 (validation); for cirrhosis versus HCC, the AUC values were 0.97 (training) and 0.93 (validation). These results matched those from the original dataset, demonstrating the model's robustness (Figure 1N, Supplementary Table S8). The miRNA set 3 was compared with 2 previously reported panels [<span>9, 10</span>] using public data that included next-generation sequencing and AFP level (GSE83977). When combined with AFP levels, our miRNA set 3 achieved superior diagnostic accuracy (AUC = 1.00 for Raw data and 0.97 for SMOTE) compared to the three-miRNA panel (AUC = 0.83 for Raw, 0.82 for SMOTE), and the alternative 8-miRNA panel (AUC = 0.87 for Raw, 0.77 for SMOTE). The AI-based, multi-marker model that incorporated 8 exo-miRNAs with AFP demonstrated the highest diagnostic accuracy for HCC across both original and augmented sample sizes (Figure 1O). This superior performance was also observed in the logistic regression-based diagnostic model, where the combination of miRNA set 3 with AFP achieved high diagnostic accuracy (Supplementary Figure S9).</p><p>In conclusion, a comprehensive biomarker discovery process was conducted using mouse models of liver disease and human serum samples. The use of mouse models in the diagnostic marker selection process identified valuable markers that might have been excluded when analyzing only human samples. The AI-based, multi-marker model that incorporated a combination of 8 exo-miRNAs with AFP level demonstrated high diagnostic performance, effectively distinguishing between a normal condition and early-stage HCC, and between cirrhosis and HCC. Previous studies have investigated blood-based diagnostics for HCC, highlighting the promise of liquid biopsy techniques [<span>11, 12</span>]. Expanding on this, our AI-based diagnostic model incorporating an exo-miRNA signature exhibits potential as a non-invasive detection tool with high accuracy and can be universally applied to various HCC types. However, to translate these strengths of our study into clinical applications, several challenges need to be addressed through further research. Specifically, (1) simplifying the exosome extraction process, (2) enhancing the AI diagnostic accuracy by incorporating diverse clinical samples from a broader population, and (3) conducting large-scale, multi-center validation studies to enhance the generalizability of the AI-based exo-miRNA diagnostic model across diverse populations.</p><p>Jin-Seong Hwang, Sugi Lee, Kiyoon Kwon, Gyeonghwa Kim, Hoibin Jeong, and Hyun-Soo Cho performed the experiments and collected the data. Sugi Lee, Dae-Soo Kim, Tae-Su Han, and Kiyoon Kwon analyzed the bioinformatics data. Eunsun Jung, Yuna Roh, Taesang Son, Hana Lee, Moo-Seung Lee, Kyoung-Jin Oh, Hye Won Lee, Yu Rim Lee, Soo Young Park, Won Young Tak, and Hyun Seung Ban analyzed and interpreted the data. Tae-Su Han, Dae-Soo Kim, Keun Hur, Jang-Seong Kim, and Mi-Young Son drafted and revised the manuscript. Tae-Su Han, Dae-Soo Kim, Keun Hur, Jang-Seong Kim, and Mi-Young Son confirmed the authenticity of all the raw data. All authors have read and approved the final manuscript.</p><p>This research was supported by grants from the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT, and Future Planning (NRF-2020R1C1C1007431, NRF-2022R1A2C1003118, RS-2024-00341766, RS-2025-00514590, and NRF-2021R1A5A2021614), the Korean Fund for Regenerative Medicine (KFRM) grant funded by the Korean government (the Ministry of Science and ICT, the Ministry of Health &amp; Welfare, 21A0404L1), and the KRIBB Research Initiative Program.</p><p>The authors declare no competing interests.</p><p>This study was approved by the Public Institutional Review Board of the Ministry of Health and Welfare of the Republic of Korea (P01-202105-31-011) and the Ethics Committee of Kyungpook National University Hospital (#KNUH-2014-04-056-001). All patients provided written informed consent prior to sample collection. The animal protocol was approved by the Committee on Animal Experimentation of the Korea Research Institute of Bioscience and Biotechnology (Approval No. KRIBB-AEC-19155).</p>","PeriodicalId":9495,"journal":{"name":"Cancer Communications","volume":"45 9","pages":"1188-1193"},"PeriodicalIF":24.9000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cac2.70043","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Communications","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cac2.70043","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Hepatocellular carcinoma (HCC) is a critical cancer worldwide due to its low survival rate [1]. In the United States, the overall 5-year survival rate of patients with HCC is 22%, which decreases sharply with cancer progression [2]. Early detection of HCC improves patient survival. Serum alpha-fetoprotein (AFP) is a widely used biomarker for the diagnosis of HCC, but it is often elevated in patients with cirrhosis, resulting in false-positive results [3]. Diagnostic markers for early detection of HCC have been investigated previously [4], but none are widely applied in clinical settings. HCC pathogenesis is closely associated with hepatitis B and C virus (HBV and HCV) infections, which induce chronic inflammation, leading to cirrhosis and elevating the risk of malignant transformation [5]. Environmental and lifestyle factors, such as diet and alcohol consumption, drive the progression from steatosis to fibrosis, cirrhosis, and eventually HCC [6]. Due to HCC's multifactorial etiology and prolonged progression, identifying early diagnostic biomarkers remains a challenge [7]. Thus, integrating analyses of both pre-HCC and cancerous samples is essential for developing robust early detection strategies. This study aimed to: (1) establish stepwise animal models for HCC-related conditions including non-alcoholic steatohepatitis (NASH) and fibrosis [8]; (2) identify exosomal microRNA (exo-miRNA) signatures for early HCC diagnosis; and (3) develop and validate an artificial intelligence (AI)-based multi-marker model combining exo-miRNAs and AFP levels for accurate HCC diagnosis using clinical samples (Supplementary Figure S1, Supplementary Materials and Methods).

Initially, stepwise mouse models of liver diseases were developed (Figure 1A-C, Supplementary Figure S2A-B). Highly similar gene expression patterns between mouse and human liver diseases were discovered using transcriptome analysis of liver tissues and comparison with public databases (Supplementary Figure S2C). The serum exosomes were then isolated and characterized (Figure 1D, Supplementary Figure S2D), followed by exo-miRNA profiling using Nanostring analysis. This profiling was conducted on mouse models of liver diseases and human samples, which included healthy individuals (n = 7), patients with cirrhosis (n = 6), and patients with HCC (n = 18) (profiling set; Supplementary Table S1).

The selection criteria for exo-miRNAs included the upregulation of exo-miRNAs in serum exosomes from mice or humans with liver diseases compared with levels in exosomes from normal mice or healthy controls. Four exo-miRNAs were upregulated in samples from mice or humans with HCC compared with those from normal mice or healthy controls. Subsequently, additional criteria were applied to distinguish between cirrhosis and HCC using the mouse model, resulting in 4 additional exo-miRNAs. Eight exo-miRNAs were finally identified: miR-22-3p, miR-30a-5p, miR-30e-5p, miR-122-5p, miR-192-5p, miR-432-5p, miR-483-5p, and miR-574-5p (Figure 1E, Supplementary Figure S3). These 8 exo-miRNAs were significantly upregulated in the serum exosomes from patients with HCC compared with those from healthy controls, as confirmed by public data analysis (Supplementary Figure S4).

To validate the expression of 8 exo-miRNAs in human serum, quantitative PCR (qPCR) was performed on a training set (n = 195) including healthy controls and patients with cirrhosis and with HCC (training set; Supplementary Table S2). Five exo-miRNAs (miR-22-3p, miR-122-5p, miR-192-5p, miR-483-5p, and miR-574-5p) were significantly upregulated in patients with HCC compared to controls, with exo-miR122-5p showing the highest area under the curve (AUC; 0.95). All exo-miRNAs were significantly elevated in patients with HCC compared to patients with cirrhosis, with AUC values of 0.72-0.88 (Supplementary Figure S5A, Figure 1F). Validation in a new cohort (n = 175; Supplementary Table S3) confirmed that all exo-miRNAs were significantly upregulated in HCC patients compared to healthy controls, with miR-122-5p exhibiting the highest AUC (0.99). Five exo-miRNAs were significantly elevated in patients with HCC compared to patients with hepatitis and cirrhosis (Supplementary Figure S5B-C, Figure 1G). Whereas individual exo-miRNAs demonstrated strong diagnostic potential, combining them into a multi-marker approach was essential to address cohort variability and enhance diagnostic accuracy for HCC (Figure 1H).

To develop a robust diagnostic approach for HCC, AI-based deep-learning was used to construct a multi-marker model incorporating exo-miRNAs and AFP levels (Figure 1I). Prior to model development, batch effect correction was applied to the qPCR data to ensure consistency (Supplementary Figure S6). The model was developed using a training set comprising samples from healthy controls, patients with cirrhosis, HCC, gastric, and colorectal cancer samples (Supplementary Table S2). Diagnostic groups were formed based on AFP levels, 4 exo-miRNAs (miRNA set 1: miR-22-3p, miR-30a-5p, miR-122-5p, and miR-192-5p) with or without AFP levels, 4 mouse-specific exo-miRNAs (miRNA set 2: miR-30e-5p, miR-432-5p, miR-483-5p, and miR-574-5p) with or without AFP levels, and a total of 8 exo-miRNAs (miRNA set 3: combining miRNA set 1 and set 2) with or without AFP levels. The AI model classified the samples into three categories: control, cirrhosis, and HCC. Receiver operating characteristic (ROC) curve analysis indicated that AFP alone could effectively differentiate HCC from gastric and colorectal cancers (AUC > 0.96); combining AFP with miRNA set 1 or set 3 improved performances (AUC = 1.00) (Supplementary Figure S7).

To distinguish healthy controls from patients with HCC, AFP alone achieved moderate diagnostic performance (training set, AUC = 0.94; validation set, AUC = 0.90), whereas miRNA set 1 or set 3 combined with AFP substantially enhanced accuracy (training set, AUC > 0.97; validation set, AUC > 0.95) (Figure 1J, Supplementary Figure S8A, Supplementary Table S4). The miRNA set 3 combined with AFP demonstrated superior diagnostic capability for early-stage HCC (AUC > 0.94), surpassing AFP alone (AUC > 0.89) in both the training and validation sets (Figure 1K, Supplementary Figure S8B, Supplementary Table S5). Using hepatitis or NASH as a control, miRNA set 3 achieved an AUC of 0.96 and 0.94, respectively, which was higher than miRNA set 1 (AUC = 0.73 and 0.69, respectively) (Supplementary Figure S8C). To distinguish cirrhosis from HCC, AFP alone and its combination with miRNA set 1 yielded low performance (validation sets, AUC = 0.57 and 0.49, respectively). However, the miRNA set 3 with AFP achieved high performance (training set, AUC = 1.00; validation set, AUC = 0.90) (Figure 1L, Supplementary Figure S8D, Supplementary Table S6). The miRNA set 3 with AFP consistently outperformed AFP alone for early-stage HCC detection and differentiation from cirrhosis, highlighting its potential as an effective diagnostic tool for HCC (Figure 1M, Supplementary Figure S8E, Supplementary Table S7).

To mitigate overfitting, the synthetic minority oversampling technique (SMOTE) was applied to the miRNA set 3 with AFP model. To distinguish HCC from controls, the AUC values were 0.97 (training) and 0.96 (validation); for cirrhosis versus HCC, the AUC values were 0.97 (training) and 0.93 (validation). These results matched those from the original dataset, demonstrating the model's robustness (Figure 1N, Supplementary Table S8). The miRNA set 3 was compared with 2 previously reported panels [9, 10] using public data that included next-generation sequencing and AFP level (GSE83977). When combined with AFP levels, our miRNA set 3 achieved superior diagnostic accuracy (AUC = 1.00 for Raw data and 0.97 for SMOTE) compared to the three-miRNA panel (AUC = 0.83 for Raw, 0.82 for SMOTE), and the alternative 8-miRNA panel (AUC = 0.87 for Raw, 0.77 for SMOTE). The AI-based, multi-marker model that incorporated 8 exo-miRNAs with AFP demonstrated the highest diagnostic accuracy for HCC across both original and augmented sample sizes (Figure 1O). This superior performance was also observed in the logistic regression-based diagnostic model, where the combination of miRNA set 3 with AFP achieved high diagnostic accuracy (Supplementary Figure S9).

In conclusion, a comprehensive biomarker discovery process was conducted using mouse models of liver disease and human serum samples. The use of mouse models in the diagnostic marker selection process identified valuable markers that might have been excluded when analyzing only human samples. The AI-based, multi-marker model that incorporated a combination of 8 exo-miRNAs with AFP level demonstrated high diagnostic performance, effectively distinguishing between a normal condition and early-stage HCC, and between cirrhosis and HCC. Previous studies have investigated blood-based diagnostics for HCC, highlighting the promise of liquid biopsy techniques [11, 12]. Expanding on this, our AI-based diagnostic model incorporating an exo-miRNA signature exhibits potential as a non-invasive detection tool with high accuracy and can be universally applied to various HCC types. However, to translate these strengths of our study into clinical applications, several challenges need to be addressed through further research. Specifically, (1) simplifying the exosome extraction process, (2) enhancing the AI diagnostic accuracy by incorporating diverse clinical samples from a broader population, and (3) conducting large-scale, multi-center validation studies to enhance the generalizability of the AI-based exo-miRNA diagnostic model across diverse populations.

Jin-Seong Hwang, Sugi Lee, Kiyoon Kwon, Gyeonghwa Kim, Hoibin Jeong, and Hyun-Soo Cho performed the experiments and collected the data. Sugi Lee, Dae-Soo Kim, Tae-Su Han, and Kiyoon Kwon analyzed the bioinformatics data. Eunsun Jung, Yuna Roh, Taesang Son, Hana Lee, Moo-Seung Lee, Kyoung-Jin Oh, Hye Won Lee, Yu Rim Lee, Soo Young Park, Won Young Tak, and Hyun Seung Ban analyzed and interpreted the data. Tae-Su Han, Dae-Soo Kim, Keun Hur, Jang-Seong Kim, and Mi-Young Son drafted and revised the manuscript. Tae-Su Han, Dae-Soo Kim, Keun Hur, Jang-Seong Kim, and Mi-Young Son confirmed the authenticity of all the raw data. All authors have read and approved the final manuscript.

This research was supported by grants from the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT, and Future Planning (NRF-2020R1C1C1007431, NRF-2022R1A2C1003118, RS-2024-00341766, RS-2025-00514590, and NRF-2021R1A5A2021614), the Korean Fund for Regenerative Medicine (KFRM) grant funded by the Korean government (the Ministry of Science and ICT, the Ministry of Health & Welfare, 21A0404L1), and the KRIBB Research Initiative Program.

The authors declare no competing interests.

This study was approved by the Public Institutional Review Board of the Ministry of Health and Welfare of the Republic of Korea (P01-202105-31-011) and the Ethics Committee of Kyungpook National University Hospital (#KNUH-2014-04-056-001). All patients provided written informed consent prior to sample collection. The animal protocol was approved by the Committee on Animal Experimentation of the Korea Research Institute of Bioscience and Biotechnology (Approval No. KRIBB-AEC-19155).

Abstract Image

一种基于血清外泌体微rna的人工智能诊断模型,用于高精度检测肝细胞癌。
肝细胞癌(HCC)因其低生存率而成为世界范围内的一种危重癌症。在美国,HCC患者的总体5年生存率为22%,随着癌症进展,这一生存率急剧下降。早期发现HCC可提高患者生存率。血清甲胎蛋白(AFP)是一种广泛用于HCC诊断的生物标志物,但它在肝硬化患者中经常升高,导致假阳性结果[3]。早期发现HCC的诊断标志物已经被研究过,但没有一个在临床环境中广泛应用。HCC的发病机制与乙型和丙型肝炎病毒(HBV和HCV)感染密切相关,乙型和丙型肝炎病毒感染可诱发慢性炎症,导致肝硬化并增加恶性转化[5]的风险。环境和生活方式因素,如饮食和饮酒,会导致脂肪变性发展为纤维化、肝硬化,并最终导致肝细胞癌。由于HCC的多因素病因和长期进展,确定早期诊断的生物标志物仍然是一个挑战。因此,整合hcc前和癌样的分析对于制定强有力的早期检测策略至关重要。本研究旨在:(1)逐步建立hcc相关疾病的动物模型,包括非酒精性脂肪性肝炎(NASH)和纤维化[8];(2)鉴别早期HCC诊断的外泌体microRNA (exo-miRNA)特征;(3)开发并验证一种基于人工智能(AI)的多标记模型,结合exo- mirna和AFP水平,用于临床样本的HCC准确诊断(补充图S1,补充材料和方法)。首先,逐步建立小鼠肝脏疾病模型(图1A-C,补充图S2A-B)。通过肝脏组织转录组分析和与公共数据库的比较,发现小鼠和人类肝脏疾病之间高度相似的基因表达模式(补充图S2C)。然后分离血清外泌体并进行表征(图1D,补充图S2D),然后使用纳米链分析进行外显子mirna分析。对肝脏疾病小鼠模型和人类样本进行分析,包括健康个体(n = 7)、肝硬化患者(n = 6)和HCC患者(n = 18)(分析集;补充表S1)。外显mirna的选择标准包括与正常小鼠或健康对照组的外泌体相比,患有肝脏疾病的小鼠或人类血清外泌体中的外显mirna水平上调。与正常小鼠或健康对照相比,来自HCC小鼠或人类的样本中有四种外显mirna上调。随后,使用小鼠模型应用其他标准来区分肝硬化和HCC,产生4个额外的外显子mirna。最终鉴定出8个外展mirna: miR-22-3p、miR-30a-5p、miR-30e-5p、miR-122-5p、miR-192-5p、miR-432-5p、miR-483-5p和miR-574-5p(图1E,补充图S3)。公开数据分析证实,与健康对照组相比,HCC患者血清外泌体中这8种外显mirna显著上调(补充图S4)。为了验证8个exo- mirna在人血清中的表达,我们对一个训练集(n = 195)进行了定量PCR (qPCR),其中包括健康对照组和肝硬化和HCC患者(训练集;补充表S2)。5种exo- mirna (miR-22-3p、miR-122-5p、miR-192-5p、miR-483-5p和miR-574-5p)在HCC患者中与对照组相比显著上调,其中exo-miR122-5p曲线下面积最大(AUC; 0.95)。与肝硬化患者相比,HCC患者的所有exo- mirna均显著升高,AUC值为0.72-0.88 (Supplementary Figure S5A, Figure 1F)。一项新队列验证(n = 175;补充表S3)证实,与健康对照组相比,HCC患者中所有外显mirna均显著上调,其中miR-122-5p的AUC最高(0.99)。与肝炎和肝硬化患者相比,HCC患者中有5种外显mirna显著升高(补充图S5B-C,图1G)。尽管单个外显子mirna显示出强大的诊断潜力,但将它们结合到多标记方法中对于解决队列变异性和提高HCC诊断准确性至关重要(图1H)。为了开发一种强大的HCC诊断方法,基于人工智能的深度学习用于构建包含外显子mirna和AFP水平的多标记模型(图1I)。在建立模型之前,对qPCR数据进行批量效应校正以确保一致性(Supplementary图S6)。该模型是使用包括健康对照、肝硬化、HCC、胃癌和结直肠癌患者样本的训练集开发的(补充表S2)。 根据AFP水平、有或没有AFP水平的4个exo-miRNA (miRNA set 1: miR-22-3p、miR-30a-5p、miR-122-5p和miR-192-5p)、有或没有AFP水平的4个小鼠特异性exo-miRNA (miRNA set 2: miR-30e-5p、miR-432-5p、miR-483-5p和mir - 571 -5p)以及有或没有AFP水平的8个exo-miRNA (miRNA set 3:结合miRNA set 1和set 2)组成诊断组。人工智能模型将样本分为三类:对照组、肝硬化和HCC。受试者工作特征(ROC)曲线分析显示,单独使用AFP可有效区分肝癌与胃癌和结直肠癌(AUC &gt; 0.96);AFP与miRNA集合1或集合3联合使用可改善性能(AUC = 1.00)(补充图S7)。为了区分健康对照和HCC患者,AFP单独获得了中等的诊断效果(训练集,AUC = 0.94;验证集,AUC = 0.90),而miRNA集1或集3联合AFP显著提高了准确性(训练集,AUC &gt; 0.97;验证集,AUC &gt; 0.95)(图1J,补充图S8A,补充表S4)。miRNA集合3联合AFP对早期HCC的诊断能力更强(AUC &gt; 0.94),在训练集和验证集均超过AFP单独(AUC &gt; 0.89)(图1K,补充图S8B,补充表S5)。以肝炎或NASH为对照,miRNA集合3的AUC分别为0.96和0.94,高于miRNA集合1 (AUC分别为0.73和0.69)(补充图S8C)。在区分肝硬化和HCC时,AFP单独和联合miRNA组1的效果较差(验证组,AUC分别为0.57和0.49)。然而,带有AFP的miRNA集合3获得了较高的性能(训练集,AUC = 1.00;验证集,AUC = 0.90)(图1L,补充图S8D,补充表S6)。结合AFP的miRNA集合3在早期HCC的检测和与肝硬化的区分方面始终优于单独使用AFP,突出了其作为HCC有效诊断工具的潜力(图1M,补充图S8E,补充表S7)。为了减轻过拟合,将合成少数过采样技术(SMOTE)应用于AFP模型的miRNA集3。为了将HCC与对照组区分开来,AUC值分别为0.97(训练)和0.96(验证);对于肝硬化和HCC, AUC值分别为0.97(训练)和0.93(验证)。这些结果与原始数据集的结果相匹配,证明了模型的鲁棒性(图1N,补充表S8)。使用包括下一代测序和AFP水平(GSE83977)在内的公开数据,将miRNA集3与先前报道的2个小组[9,10]进行比较。当结合AFP水平时,我们的miRNA组3与3 miRNA组(Raw的AUC = 0.83, SMOTE的AUC = 0.82)和8 miRNA组(Raw的AUC = 0.87, SMOTE的AUC = 0.77)相比,获得了更高的诊断准确性(Raw数据的AUC = 1.00, SMOTE的AUC = 0.97)。基于人工智能的多标记模型将8个外显子mirna与AFP结合在一起,无论在原始样本量还是扩增样本量上,都显示出最高的HCC诊断准确性(图10)。在基于逻辑回归的诊断模型中也观察到这种优越的性能,其中miRNA集3与AFP的组合获得了很高的诊断准确性(补充图S9)。总之,使用肝脏疾病小鼠模型和人类血清样本进行了全面的生物标志物发现过程。在诊断标记选择过程中使用小鼠模型确定了在仅分析人类样本时可能被排除的有价值的标记。基于人工智能的多标记模型结合了8种外显mirna和AFP水平,显示出很高的诊断性能,可以有效区分正常和早期HCC,以及肝硬化和HCC。先前的研究已经研究了基于血液的HCC诊断,强调了液体活检技术的前景[11,12]。在此基础上,我们基于人工智能的诊断模型结合exo-miRNA特征,显示出作为一种非侵入性检测工具的潜力,具有很高的准确性,可以普遍应用于各种类型的HCC。然而,为了将我们研究的这些优势转化为临床应用,需要通过进一步的研究来解决几个挑战。具体而言,(1)简化外泌体提取过程,(2)通过纳入来自更广泛人群的不同临床样本来提高人工智能诊断的准确性,以及(3)进行大规模、多中心验证研究,以增强基于人工智能的外显子mirna诊断模型在不同人群中的普遍性。黄振成、李sugi、权基允、金庆华、郑会彬、赵贤洙等人进行了实验并收集了数据。李sugi, Kim Dae-Soo, Han Tae-Su, Kwon Kiyoon分析了生物信息学数据。 郑恩善、卢允娜、孙太祥、李汉娜、李武承、吴景振、李惠媛、李柳林、朴秀英、德元荣、潘铉承等人对数据进行了分析和解释。韩泰洙、金大洙、许根、金章成、孙美英等人负责了原稿的修改工作。Han Tae-Su, Kim Dae-Soo, Keun Hur, Jang-Seong Kim和Mi-Young Son确认了所有原始数据的真实性。所有作者都阅读并批准了最终稿件。本研究由韩国国家研究基金会(NRF)资助,由科学,信息通信技术和未来规划部(NRF- 2020r1c1c1007431, NRF- 2022r1a2c1003118, RS-2024-00341766, RS-2025-00514590和NRF- 2021r1a5a2021614),韩国再生医学基金(KFRM)资助,由韩国政府(科学和信息通信技术部,卫生福利部,21A0404L1)和KRIBB研究倡议计划资助。作者声明没有利益冲突。本研究得到了大韩民国保健福利部公共机构审查委员会(P01-202105-31-011)和庆北国立大学医院伦理委员会(#KNUH-2014-04-056-001)的批准。所有患者在采集样本前均提供书面知情同意书。动物实验方案经韩国生命科学技术研究院动物实验委员会批准(批准号:No. 5)。kribb -原子能委员会- 19155)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cancer Communications
Cancer Communications Biochemistry, Genetics and Molecular Biology-Cancer Research
CiteScore
25.50
自引率
4.30%
发文量
153
审稿时长
4 weeks
期刊介绍: Cancer Communications is an open access, peer-reviewed online journal that encompasses basic, clinical, and translational cancer research. The journal welcomes submissions concerning clinical trials, epidemiology, molecular and cellular biology, and genetics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信