Jie Gao, Tao Chen, Yong Xu, Yijie Wu, Kunhong Liu, Weihong Qiu, Weimin Ye
{"title":"Accurate fatty liver disease diagnosis with a multi-source feature fusion model on the segmented tongue image dataset","authors":"Jie Gao, Tao Chen, Yong Xu, Yijie Wu, Kunhong Liu, Weihong Qiu, Weimin Ye","doi":"10.1016/j.jare.2025.10.003","DOIUrl":null,"url":null,"abstract":"<h3>Introduction</h3>More than 100 million individuals in rural areas of China are suffered from Fatty Liver Disease (FLD). However, health clinics in remote regions often lack the necessary professional expertise and expensive ultrasound equipment for regular liver disease screening. Delayed treatment frequently leads to liver cirrhosis and cancer, imposing substantial economic burden on both public health systems and affected families.<h3>Objectives</h3>Traditional Chinese Medicine emphasizes the strong association between tongue characteristics and liver health. Leveraging machine learning to model the relationship between tongue images and FLD can enable rapid, non-invasive, large-scale screening in medically underserved areas. However, existing studies in this domain often rely on small-scale private datasets, which can result in unverifiable model performance. Moreover, most studies have employed generic convolutional neural networks for feature extraction, causing a lack of interpretability. The goal of our research is to address above-mentioned questions.<h3>Methods</h3>In this study, we first introduced a Multi-source Feature Fusion-based Tongue Diagnosis Framework for FLD diagnosis (MFF-TDF). In addition, we developed and released a standardized tongue image dataset with physiological indicators and FLD annotations, comprising 5,717 samples, which to our knowledge is the largest public dataset in this domain. Finally, we evaluated the effectiveness of the proposed method through extensive experiments and enhanced model interpretability using shapley additive explanations and counterfactual analysis.<h3>Results</h3>When conducting fusion modeling with tongue images and some basic physiological indicators (such as sex, age, height, etc.), FLD’s prediction performance in the population reached F1-score 0.797, Recall 0.847, and AUC 0.924. This performance significantly exceeds that of the state-of-the-art methods published in this domain.<h3>Conclusion</h3>This study developed an automated and explainable method for tongue diagnosis that facilitated the low-cost, speedy screening of FLD in large-scale populations, and contributed the largest public dataset to support future modeling research in this field.","PeriodicalId":14952,"journal":{"name":"Journal of Advanced Research","volume":"26 1","pages":""},"PeriodicalIF":13.0000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advanced Research","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1016/j.jare.2025.10.003","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
More than 100 million individuals in rural areas of China are suffered from Fatty Liver Disease (FLD). However, health clinics in remote regions often lack the necessary professional expertise and expensive ultrasound equipment for regular liver disease screening. Delayed treatment frequently leads to liver cirrhosis and cancer, imposing substantial economic burden on both public health systems and affected families.
Objectives
Traditional Chinese Medicine emphasizes the strong association between tongue characteristics and liver health. Leveraging machine learning to model the relationship between tongue images and FLD can enable rapid, non-invasive, large-scale screening in medically underserved areas. However, existing studies in this domain often rely on small-scale private datasets, which can result in unverifiable model performance. Moreover, most studies have employed generic convolutional neural networks for feature extraction, causing a lack of interpretability. The goal of our research is to address above-mentioned questions.
Methods
In this study, we first introduced a Multi-source Feature Fusion-based Tongue Diagnosis Framework for FLD diagnosis (MFF-TDF). In addition, we developed and released a standardized tongue image dataset with physiological indicators and FLD annotations, comprising 5,717 samples, which to our knowledge is the largest public dataset in this domain. Finally, we evaluated the effectiveness of the proposed method through extensive experiments and enhanced model interpretability using shapley additive explanations and counterfactual analysis.
Results
When conducting fusion modeling with tongue images and some basic physiological indicators (such as sex, age, height, etc.), FLD’s prediction performance in the population reached F1-score 0.797, Recall 0.847, and AUC 0.924. This performance significantly exceeds that of the state-of-the-art methods published in this domain.
Conclusion
This study developed an automated and explainable method for tongue diagnosis that facilitated the low-cost, speedy screening of FLD in large-scale populations, and contributed the largest public dataset to support future modeling research in this field.
期刊介绍:
Journal of Advanced Research (J. Adv. Res.) is an applied/natural sciences, peer-reviewed journal that focuses on interdisciplinary research. The journal aims to contribute to applied research and knowledge worldwide through the publication of original and high-quality research articles in the fields of Medicine, Pharmaceutical Sciences, Dentistry, Physical Therapy, Veterinary Medicine, and Basic and Biological Sciences.
The following abstracting and indexing services cover the Journal of Advanced Research: PubMed/Medline, Essential Science Indicators, Web of Science, Scopus, PubMed Central, PubMed, Science Citation Index Expanded, Directory of Open Access Journals (DOAJ), and INSPEC.