Mei Zhao, Hengyu Zhou, Jing Wang, Yongyue Liu, Xiaoqing Zhang
{"title":"A new method for identification of traditional Chinese medicine constitution based on tongue features with machine learning.","authors":"Mei Zhao, Hengyu Zhou, Jing Wang, Yongyue Liu, Xiaoqing Zhang","doi":"10.3233/THC-240128","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The theory of Chinese medicine (TCM) constitution contributes to the optimisation of individualised healthcare programmes. However, at present, TCM constitution identification mainly relies on inefficient questionnaires with subjective bias. Efficient and accurate TCM constitution identification can play an important role in individualised medicine and healthcare.</p><p><strong>Objective: </strong>Building an efficient model for identifying traditional Chinese medicine constitutions using objective tongue features and machine learning techniques.</p><p><strong>Methods: </strong>The DS01-A device was applied to collect tongue images and extract features. We trained and evaluated five machine learning models: Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), LightGBM (LGBM), and CatBoost (CB). Among these, we selected the model with the best performance as the base classifier for constructing our heterogeneous ensemble learning model. Using various performance metrics, including classification accuracy, precision, recall, F1 score, and area under curve (AUC), to comprehensively evaluate model performance.</p><p><strong>Results: </strong>A total of 1149 tongue images were obtained and 45 features were extracted, forming dataset 1. RF, LGBM, and CB were selected as the base learners for the RLC-Stacking. On dataset 1, RLC-Stacking1 achieved an accuracy of 0.8122, outperforming individual classifiers. After feature selection, the classification accuracy of RLC-Stacking2 improved to 0.8287, an improvement of 0.00165 compared to RLC-Stacking1. RLC-Stacking2 achieved an accuracy exceeding 0.85 for identifying each TCM constitution type, indicating excellent identification performance.</p><p><strong>Conclusion: </strong>The study provides a reliable method for the accurate and rapid identification of TCM constitutions and can assist clinicians in tailoring individualized medical treatments based on personal constitution types and guide daily health care. The information extracted from tongue images serves as an effective marker for objective TCM constitution identification.</p>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3233/THC-240128","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The theory of Chinese medicine (TCM) constitution contributes to the optimisation of individualised healthcare programmes. However, at present, TCM constitution identification mainly relies on inefficient questionnaires with subjective bias. Efficient and accurate TCM constitution identification can play an important role in individualised medicine and healthcare.
Objective: Building an efficient model for identifying traditional Chinese medicine constitutions using objective tongue features and machine learning techniques.
Methods: The DS01-A device was applied to collect tongue images and extract features. We trained and evaluated five machine learning models: Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), LightGBM (LGBM), and CatBoost (CB). Among these, we selected the model with the best performance as the base classifier for constructing our heterogeneous ensemble learning model. Using various performance metrics, including classification accuracy, precision, recall, F1 score, and area under curve (AUC), to comprehensively evaluate model performance.
Results: A total of 1149 tongue images were obtained and 45 features were extracted, forming dataset 1. RF, LGBM, and CB were selected as the base learners for the RLC-Stacking. On dataset 1, RLC-Stacking1 achieved an accuracy of 0.8122, outperforming individual classifiers. After feature selection, the classification accuracy of RLC-Stacking2 improved to 0.8287, an improvement of 0.00165 compared to RLC-Stacking1. RLC-Stacking2 achieved an accuracy exceeding 0.85 for identifying each TCM constitution type, indicating excellent identification performance.
Conclusion: The study provides a reliable method for the accurate and rapid identification of TCM constitutions and can assist clinicians in tailoring individualized medical treatments based on personal constitution types and guide daily health care. The information extracted from tongue images serves as an effective marker for objective TCM constitution identification.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.