虚拟现实中的Tacotron模型与CNN用于癌症诊断与医患沟通

2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT) Pub Date : 2021-10-01 DOI:10.1109/AINIT54228.2021.00093

Sha Jin, Jiayi Li

{"title":"虚拟现实中的Tacotron模型与CNN用于癌症诊断与医患沟通","authors":"Sha Jin, Jiayi Li","doi":"10.1109/AINIT54228.2021.00093","DOIUrl":null,"url":null,"abstract":"Virtual reality is widely used in various fields, such as military, industrial and medical fields. CNN is prevalent when solving problems about image classification. NLP is the most useful model to realize speech synthesis. However, image classification is seldom combined with speech synthesis to be used in a realistic scene. In order to tackle this issue, this paper proposed a system, which combines image classification with speech synthesis organically. There are three steps used to build this system in this paper. First, this paper devises a model to classify whether the patients have skin cancer. It designs a CNN model to deal with the classification of images of skin, diagnosing whether the patient suffers from Melanoma. Second, the Tacotron model is included in this paper to implement speech synthesis, telling the diagnostic results obtained from image classification to the patients. Finally, a virtual reality environment is built to display a scene when a patient entering the hospital and then being diagnosed and getting treatment.","PeriodicalId":326400,"journal":{"name":"2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Tacotron Model and CNN in Virtual Reality for Cancer Diagnosis and Communication between Doctors and Patients\",\"authors\":\"Sha Jin, Jiayi Li\",\"doi\":\"10.1109/AINIT54228.2021.00093\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Virtual reality is widely used in various fields, such as military, industrial and medical fields. CNN is prevalent when solving problems about image classification. NLP is the most useful model to realize speech synthesis. However, image classification is seldom combined with speech synthesis to be used in a realistic scene. In order to tackle this issue, this paper proposed a system, which combines image classification with speech synthesis organically. There are three steps used to build this system in this paper. First, this paper devises a model to classify whether the patients have skin cancer. It designs a CNN model to deal with the classification of images of skin, diagnosing whether the patient suffers from Melanoma. Second, the Tacotron model is included in this paper to implement speech synthesis, telling the diagnostic results obtained from image classification to the patients. Finally, a virtual reality environment is built to display a scene when a patient entering the hospital and then being diagnosed and getting treatment.\",\"PeriodicalId\":326400,\"journal\":{\"name\":\"2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AINIT54228.2021.00093\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINIT54228.2021.00093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

虚拟现实技术被广泛应用于军事、工业、医疗等各个领域。CNN在解决图像分类问题时非常流行。自然语言处理是实现语音合成最有用的模型。然而，将图像分类与语音合成相结合并应用于真实场景的研究却很少。为了解决这一问题，本文提出了一种将图像分类与语音合成有机结合的系统。本文通过三个步骤来构建这个系统。首先，本文设计了一个模型来对患者是否患有皮肤癌进行分类。它设计了一个CNN模型来处理皮肤图像的分类，诊断患者是否患有黑色素瘤。其次，本文采用Tacotron模型实现语音合成，将图像分类得到的诊断结果告知患者。最后，构建虚拟现实环境，展示患者进入医院，进行诊断和治疗的场景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Tacotron Model and CNN in Virtual Reality for Cancer Diagnosis and Communication between Doctors and Patients

Virtual reality is widely used in various fields, such as military, industrial and medical fields. CNN is prevalent when solving problems about image classification. NLP is the most useful model to realize speech synthesis. However, image classification is seldom combined with speech synthesis to be used in a realistic scene. In order to tackle this issue, this paper proposed a system, which combines image classification with speech synthesis organically. There are three steps used to build this system in this paper. First, this paper devises a model to classify whether the patients have skin cancer. It designs a CNN model to deal with the classification of images of skin, diagnosing whether the patient suffers from Melanoma. Second, the Tacotron model is included in this paper to implement speech synthesis, telling the diagnostic results obtained from image classification to the patients. Finally, a virtual reality environment is built to display a scene when a patient entering the hospital and then being diagnosed and getting treatment.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)

自引率

0.00%

发文量