{"title":"虚拟现实中的Tacotron模型与CNN用于癌症诊断与医患沟通","authors":"Sha Jin, Jiayi Li","doi":"10.1109/AINIT54228.2021.00093","DOIUrl":null,"url":null,"abstract":"Virtual reality is widely used in various fields, such as military, industrial and medical fields. CNN is prevalent when solving problems about image classification. NLP is the most useful model to realize speech synthesis. However, image classification is seldom combined with speech synthesis to be used in a realistic scene. In order to tackle this issue, this paper proposed a system, which combines image classification with speech synthesis organically. There are three steps used to build this system in this paper. First, this paper devises a model to classify whether the patients have skin cancer. It designs a CNN model to deal with the classification of images of skin, diagnosing whether the patient suffers from Melanoma. Second, the Tacotron model is included in this paper to implement speech synthesis, telling the diagnostic results obtained from image classification to the patients. Finally, a virtual reality environment is built to display a scene when a patient entering the hospital and then being diagnosed and getting treatment.","PeriodicalId":326400,"journal":{"name":"2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Tacotron Model and CNN in Virtual Reality for Cancer Diagnosis and Communication between Doctors and Patients\",\"authors\":\"Sha Jin, Jiayi Li\",\"doi\":\"10.1109/AINIT54228.2021.00093\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Virtual reality is widely used in various fields, such as military, industrial and medical fields. CNN is prevalent when solving problems about image classification. NLP is the most useful model to realize speech synthesis. However, image classification is seldom combined with speech synthesis to be used in a realistic scene. In order to tackle this issue, this paper proposed a system, which combines image classification with speech synthesis organically. There are three steps used to build this system in this paper. First, this paper devises a model to classify whether the patients have skin cancer. It designs a CNN model to deal with the classification of images of skin, diagnosing whether the patient suffers from Melanoma. Second, the Tacotron model is included in this paper to implement speech synthesis, telling the diagnostic results obtained from image classification to the patients. Finally, a virtual reality environment is built to display a scene when a patient entering the hospital and then being diagnosed and getting treatment.\",\"PeriodicalId\":326400,\"journal\":{\"name\":\"2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AINIT54228.2021.00093\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINIT54228.2021.00093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Tacotron Model and CNN in Virtual Reality for Cancer Diagnosis and Communication between Doctors and Patients
Virtual reality is widely used in various fields, such as military, industrial and medical fields. CNN is prevalent when solving problems about image classification. NLP is the most useful model to realize speech synthesis. However, image classification is seldom combined with speech synthesis to be used in a realistic scene. In order to tackle this issue, this paper proposed a system, which combines image classification with speech synthesis organically. There are three steps used to build this system in this paper. First, this paper devises a model to classify whether the patients have skin cancer. It designs a CNN model to deal with the classification of images of skin, diagnosing whether the patient suffers from Melanoma. Second, the Tacotron model is included in this paper to implement speech synthesis, telling the diagnostic results obtained from image classification to the patients. Finally, a virtual reality environment is built to display a scene when a patient entering the hospital and then being diagnosed and getting treatment.