[Accuracy of multi-task network based on vision Transformer in the three-dimensional upper airway analysis].

Q4 Medicine

中华口腔医学杂志 Pub Date : 2024-09-09 DOI:10.3760/cma.j.cn112144-20240514-00205

S H Jin, H J Han, F Chen, X Y Guan, F Hua, H He

{"title":"[Accuracy of multi-task network based on vision Transformer in the three-dimensional upper airway analysis].","authors":"S H Jin, H J Han, F Chen, X Y Guan, F Hua, H He","doi":"10.3760/cma.j.cn112144-20240514-00205","DOIUrl":null,"url":null,"abstract":"Objective: To explore the accuracy of a multi-task model based on vision Transformer for analyzing the three-dimensional (3D) upper airway and its subregions, and to evaluate its clinical applicability. Methods: According to the inclusion and exclusion criteria, cone-beam CT (CBCT) data of 10 patients [4 males and 6 females, (20.8±2.7) years] who had their first visit to the Department of Orthodontics in the Hospital of Stomatology, Wuhan University from January 2012 to January 2020 were retrospectively selected. The 3D slicer software was used to segment the upper airway and pharyngeal airway and measure their volumes as the gold standard. The Dolphin 3D software was used to segment the pharyngeal airway and its subregions and measure their volumes as the gold standard. A multi-task model based on vision Transformer developed by the research team for automatic segmentation and volume measurement of the upper airway and its subregions. All the measurements were conducted by the same attending physician. The Bland-Altman analysis and intraclass correlation coefficient (ICC) were used to evaluate the consistency between the multi-task network and the gold standard in the upper airway segmentation and volume measurements, and the paired t test was used to compare the differences between the multi-tasking model and the gold standard. Results: The mean volume deviation of the upper airway segmented by multi-task model and 3D Slicer was -979.6 mm3, and the ICC was 0.97. The mean volume deviation of the pharyngeal airway, nasopharynx, velopharynx, glossopharynx and hypopharynx segmented by multi-task network and Dolphin 3D were 2 069.5, -950.1, -823.6, -813.9 and 4 003.4 mm3, respectively. In addition, ICC in pharyngeal airway, nasopharynx, velopharynx, glossopharynx and hypopharynx were 0.97, 0.94, 0.96, 0.96 and 0.69, respectively. Conclusions: The multi-task model based on vision Transformer produced different errors in the segmentation of 3D upper airway and its subregions. The segmentation of the nasopharynx, velopharynx and glossopharynx was in good agreement with the gold standard, while the segmentation of hypopharynx was poor, suggesting that the robustness and generalization of this model should be further enhanced.","PeriodicalId":23965,"journal":{"name":"中华口腔医学杂志","volume":"59 9","pages":"911-918"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华口腔医学杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112144-20240514-00205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: To explore the accuracy of a multi-task model based on vision Transformer for analyzing the three-dimensional (3D) upper airway and its subregions, and to evaluate its clinical applicability. Methods: According to the inclusion and exclusion criteria, cone-beam CT (CBCT) data of 10 patients [4 males and 6 females, (20.8±2.7) years] who had their first visit to the Department of Orthodontics in the Hospital of Stomatology, Wuhan University from January 2012 to January 2020 were retrospectively selected. The 3D slicer software was used to segment the upper airway and pharyngeal airway and measure their volumes as the gold standard. The Dolphin 3D software was used to segment the pharyngeal airway and its subregions and measure their volumes as the gold standard. A multi-task model based on vision Transformer developed by the research team for automatic segmentation and volume measurement of the upper airway and its subregions. All the measurements were conducted by the same attending physician. The Bland-Altman analysis and intraclass correlation coefficient (ICC) were used to evaluate the consistency between the multi-task network and the gold standard in the upper airway segmentation and volume measurements, and the paired t test was used to compare the differences between the multi-tasking model and the gold standard. Results: The mean volume deviation of the upper airway segmented by multi-task model and 3D Slicer was -979.6 mm³, and the ICC was 0.97. The mean volume deviation of the pharyngeal airway, nasopharynx, velopharynx, glossopharynx and hypopharynx segmented by multi-task network and Dolphin 3D were 2 069.5, -950.1, -823.6, -813.9 and 4 003.4 mm³, respectively. In addition, ICC in pharyngeal airway, nasopharynx, velopharynx, glossopharynx and hypopharynx were 0.97, 0.94, 0.96, 0.96 and 0.69, respectively. Conclusions: The multi-task model based on vision Transformer produced different errors in the segmentation of 3D upper airway and its subregions. The segmentation of the nasopharynx, velopharynx and glossopharynx was in good agreement with the gold standard, while the segmentation of hypopharynx was poor, suggesting that the robustness and generalization of this model should be further enhanced.

查看原文本刊更多论文

[基于视觉转换器的多任务网络在三维上气道分析中的准确性]。

目的探索基于视觉转换器的多任务模型分析三维（3D）上气道及其亚区域的准确性，并评估其临床适用性。研究方法根据纳入和排除标准，回顾性选取2012年1月至2020年1月在武汉大学口腔医院正畸科首次就诊的10例患者[男4例，女6例，（20.8±2.7）岁]的锥束CT（CBCT）数据。使用三维切片软件分割上气道和咽气道，并测量其体积作为金标准。Dolphin 3D 软件用于分割咽气道及其亚区域，并测量其体积作为金标准。研究小组开发了一个基于视觉 Transformer 的多任务模型，用于自动分割和测量上气道及其亚区的容积。所有测量均由同一位主治医生进行。采用布兰德-阿尔特曼分析和类内相关系数（ICC）评估多任务网络与金标准在上气道分割和容积测量方面的一致性，采用配对 t 检验比较多任务模型与金标准之间的差异。结果显示多任务模型和三维切片机分割的上气道平均体积偏差为-979.6 mm3，ICC为0.97。多任务网络和 Dolphin 3D 对咽气道、鼻咽、会厌、舌咽和下咽的平均体积偏差分别为 2 069.5、-950.1、-823.6、-813.9 和 4 003.4 mm3。此外，咽气道、鼻咽、会厌、舌咽和下咽的 ICC 分别为 0.97、0.94、0.96、0.96 和 0.69。结论基于视觉转换器的多任务模型在分割三维上气道及其亚区域时产生了不同的误差。鼻咽、velopharynx和舌咽的分割与金标准吻合较好，而下咽的分割较差，表明该模型的鲁棒性和通用性有待进一步提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

中华口腔医学杂志 Medicine-Medicine (all)

CiteScore

0.90

自引率

0.00%

发文量

9692

期刊介绍： Founded in August 1953, Chinese Journal of Stomatology is a monthly academic journal of stomatology published publicly at home and abroad, sponsored by the Chinese Medical Association and co-sponsored by the Chinese Stomatology Association. It mainly reports the leading scientific research results and clinical diagnosis and treatment experience in the field of oral medicine, as well as the basic theoretical research that has a guiding role in oral clinical practice and is closely combined with oral clinical practice. Chinese Journal of Over the years, Stomatology has been published in Medline, Scopus database, Toxicology Abstracts Database, Chemical Abstracts Database, American Cancer database, Russian Abstracts database, China Core Journal of Science and Technology, Peking University Core Journal, CSCD and other more than 20 important journals at home and abroad Physical medicine database and retrieval system included.