Articulatory Data of Audiovisual Records of Speech Connected by Machine Learning

2022 IEEE 2nd Conference on Information Technology and Data Science (CITDS) Pub Date : 2022-05-16 DOI:10.1109/CITDS54976.2022.9914284

R. Trencsényi, L. Czap

引用次数: 0

Abstract

The center of attraction of the present study is the application of neural networks for combining data arising from dynamic audiovisual sources made by ultrasound and magnetic resonance imaging methods, which store image and sound signals recorded during human speech. The objectives of machine learning are tongue contours fitted to the frames of the audiovisual packages by automatic contour tracking algorithms.

查看原文本刊更多论文

机器学习连接语音视听记录的发音数据

本研究的重点是应用神经网络来结合由超声和磁共振成像方法产生的动态视听源数据，这些数据存储了人类说话过程中记录的图像和声音信号。机器学习的目标是通过自动轮廓跟踪算法将舌头轮廓拟合到视听包的框架上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 2nd Conference on Information Technology and Data Science (CITDS)

自引率

0.00%

发文量