{"title":"QA-TSN:快速准确舌苔分割网","authors":"Guangze Jia, Zhenchao Cui, Qingsong Fei","doi":"10.1016/j.knosys.2024.112648","DOIUrl":null,"url":null,"abstract":"<div><div>Tongue segmentation is an essential part for computer-aided tongue diagnosis. Since of similar color and texture between tongue body and non-tongue body, such as lips and face, existing methods produce the lack of accuracy and completeness for tongue segmentation results. Moreover, small samples in tongue datasets lead under-fitting on CNN-based methods which always produce poor segmentation. To solve these problems, we designed the quick accurate tongue segmentation net (QA-TSN) to segment tongue body. To alleviate small sample problem, in the proposed method, a tongue-style transfer generation net(T-STGN) was propose to synthesize tongue images. In T-STGN, a novel encoder–decoder structure with two encoder with a global rendering block was used to refine global characteristics of synthetic tongue images. For real-time tongue segmentation, quicker tongue segmentation net (QTSN) was proposed in QA-TSN. In QTSN, we used an encoder–decoder structure with modified partial convolution (MPConv) to expedite the computation for real-time segmentation. To smooth the segments of tongue body, a novel loss function of tongue segmentation loss (TSL) was proposed. In TSL, tongue edge loss (TEL) was used to smooth the boundary of segmentation of tongue body and tongue area loss (TAL) was proposed to improve the fragmentation of segmentation results. Experiments conducted on tongue datasets achieved an IoU of 98.0307 and a Dice score of 99.0738, with a frame rate of 75.35, outperforming all other methods involved in the experiment. These results demonstrate the effectiveness of the proposed QA-TSN.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"307 ","pages":"Article 112648"},"PeriodicalIF":7.2000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"QA-TSN: QuickAccurate Tongue Segmentation Net\",\"authors\":\"Guangze Jia, Zhenchao Cui, Qingsong Fei\",\"doi\":\"10.1016/j.knosys.2024.112648\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Tongue segmentation is an essential part for computer-aided tongue diagnosis. Since of similar color and texture between tongue body and non-tongue body, such as lips and face, existing methods produce the lack of accuracy and completeness for tongue segmentation results. Moreover, small samples in tongue datasets lead under-fitting on CNN-based methods which always produce poor segmentation. To solve these problems, we designed the quick accurate tongue segmentation net (QA-TSN) to segment tongue body. To alleviate small sample problem, in the proposed method, a tongue-style transfer generation net(T-STGN) was propose to synthesize tongue images. In T-STGN, a novel encoder–decoder structure with two encoder with a global rendering block was used to refine global characteristics of synthetic tongue images. For real-time tongue segmentation, quicker tongue segmentation net (QTSN) was proposed in QA-TSN. In QTSN, we used an encoder–decoder structure with modified partial convolution (MPConv) to expedite the computation for real-time segmentation. To smooth the segments of tongue body, a novel loss function of tongue segmentation loss (TSL) was proposed. In TSL, tongue edge loss (TEL) was used to smooth the boundary of segmentation of tongue body and tongue area loss (TAL) was proposed to improve the fragmentation of segmentation results. Experiments conducted on tongue datasets achieved an IoU of 98.0307 and a Dice score of 99.0738, with a frame rate of 75.35, outperforming all other methods involved in the experiment. These results demonstrate the effectiveness of the proposed QA-TSN.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"307 \",\"pages\":\"Article 112648\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2024-11-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705124012826\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124012826","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Tongue segmentation is an essential part for computer-aided tongue diagnosis. Since of similar color and texture between tongue body and non-tongue body, such as lips and face, existing methods produce the lack of accuracy and completeness for tongue segmentation results. Moreover, small samples in tongue datasets lead under-fitting on CNN-based methods which always produce poor segmentation. To solve these problems, we designed the quick accurate tongue segmentation net (QA-TSN) to segment tongue body. To alleviate small sample problem, in the proposed method, a tongue-style transfer generation net(T-STGN) was propose to synthesize tongue images. In T-STGN, a novel encoder–decoder structure with two encoder with a global rendering block was used to refine global characteristics of synthetic tongue images. For real-time tongue segmentation, quicker tongue segmentation net (QTSN) was proposed in QA-TSN. In QTSN, we used an encoder–decoder structure with modified partial convolution (MPConv) to expedite the computation for real-time segmentation. To smooth the segments of tongue body, a novel loss function of tongue segmentation loss (TSL) was proposed. In TSL, tongue edge loss (TEL) was used to smooth the boundary of segmentation of tongue body and tongue area loss (TAL) was proposed to improve the fragmentation of segmentation results. Experiments conducted on tongue datasets achieved an IoU of 98.0307 and a Dice score of 99.0738, with a frame rate of 75.35, outperforming all other methods involved in the experiment. These results demonstrate the effectiveness of the proposed QA-TSN.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.