VNU Journal of Science: Computer Science and Communication Engineering最新文献_第4页

ASR - VLSP 2021: Semi-supervised Ensemble Model for Vietnamese Automatic Speech Recognition ASR - VLSP 2021:越南语自动语音识别的半监督集成模型

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI: 10.25073/2588-1086/vnucsce.332

Phạm Việt Thành, Le Duc Cuong, Dao Dang Huy, Luu Duc Thanh, Nguyen Duc Tan, Dang Trung Duc Anh, Nguyen Thi Thu Trang

引用次数: 0

NER - VLSP 2021: A Span-Based Model for Named Entity Recognition Task with Co-teaching+ Training Strategy NER - VLSP 2021:基于跨域的协同教学+训练策略命名实体识别任务模型

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI: 10.25073/2588-1086/vnucsce.328

Pham Hoai Phu Thinh, Vu Tran Duy, Do Tran Anh Duc

引用次数: 0

VLSP 2021 - ASR Challenge for Vietnamese Automatic Speech Recognition VLSP 2021 -越南语自动语音识别的ASR挑战

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI: 10.25073/2588-1086/vnucsce.356

Van Hai Do

引用次数: 0

VLSP 2021 - TTS Challenge: Vietnamese Spontaneous Speech Synthesis VLSP 2021 - TTS挑战:越南语自发语音合成

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI: 10.25073/2588-1086/vnucsce.358

Nguyen Thi Thu Trang, H. Nguyen

{"title":"VLSP 2021 - TTS Challenge: Vietnamese Spontaneous Speech Synthesis","authors":"Nguyen Thi Thu Trang, H. Nguyen","doi":"10.25073/2588-1086/vnucsce.358","DOIUrl":"https://doi.org/10.25073/2588-1086/vnucsce.358","url":null,"abstract":"Text-To-Speech (TTS) was one of nine shared tasks in the eighth annual international VLSP 2021 workshop. All three previous TTS shared tasks were conducted on reading datasets. However, the synthetic voices were not natural enough for spoken dialog systems where the computer must talk to the human in a conversation. Speech datasets recorded in a spontaneous environment help a TTS system to produce more natural voices in speaking style, speaking rate, intonation... Therefore, in this shared task, participants were asked to build a TTS system from a spontaneous speech dataset. This 7.5-hour dataset was collected from a channel of a famous youtuber \"Giang ơi...\"and then pre-processed to build utterances and their corresponding texts. Main challenges at this task this year were: (i) inconsistency in speaking rate, intensity, stress and prosody across the dataset, (ii) background noises or mixed with other voices, and (iii) inaccurate transcripts. A total of 43 teams registered to participate in this shared task, and finally, 8 submissions were evaluated online with perceptual tests. Two types of perceptual tests were conducted: (i) MOS test for naturalness and (ii) SUS (Semantically Unpredictable Sentences) test for intelligibility. The best SUS intelligibility TTS system had a syllable error rate of 15%, while the best MOS score on dialog utterances was 3.98 over 4.54 points on a 5-point MOS scale. The prosody and speaking rate of synthetic voices were similar to the natural one. However, there were still some distorted segments and background noises in most of TTS systems, a half of which had a syllable error rate of at least 30%.","PeriodicalId":416488,"journal":{"name":"VNU Journal of Science: Computer Science and Communication Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130015394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

TTS - VLSP 2021: Development of Smartcall Vietnamese Text-to-Speech TTS - VLSP 2021:智能呼叫越南文转语音的发展

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI: 10.25073/2588-1086/vnucsce.348

Nguyen Quoc Bao, Le Ba Hoai, N. Hoc, Dam Ba Quyen, Nguyen Thu Phuong

{"title":"TTS - VLSP 2021: Development of Smartcall Vietnamese Text-to-Speech","authors":"Nguyen Quoc Bao, Le Ba Hoai, N. Hoc, Dam Ba Quyen, Nguyen Thu Phuong","doi":"10.25073/2588-1086/vnucsce.348","DOIUrl":"https://doi.org/10.25073/2588-1086/vnucsce.348","url":null,"abstract":"Recent advances in deep learning facilitate the development of end-to-end Vietnamese text-to-speech (TTS) systems with high intelligibility and naturalness in the presence of a clean training corpus. Given a rich source of audio recording data on the Internet, TTS has excellent potential for growth if it can take advantage of this data source. However, the quality of these data is often not sufficient for training TTS systems, e.g., noisy audio. In this paper, we propose an approach that preprocesses noisy found data on the Internet and trains a high-quality TTS model on the processed data. The VLSP-provided training data was thoroughly preprocessed using 1) voice activity detection, 2) automatic speech recognition-based prosodic punctuation insertion, and 3) Spleeter, source separation tool, for separating voice from background music. Moreover, we utilize a state-of-the-art TTS system that takes advantage of the Conditional Variational Autoencoder with the Adversarial Learning model. Our experiment showed that the proposed TTS system trained on the preprocessed data achieved a good result on the provided noisy dataset.","PeriodicalId":416488,"journal":{"name":"VNU Journal of Science: Computer Science and Communication Engineering","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117299463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ASR - VLSP 2021: An Efficient Transformer-based Approach for Vietnamese ASR Task ASR - VLSP 2021:越南ASR任务的高效变压器方法

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI: 10.25073/2588-1086/vnucsce.325

Toan Truong Tien

引用次数: 0

ASR - VLSP 2021: Automatic Speech Recognition with Blank Label Re-weighting ASR - VLSP 2021:自动语音识别与空白标签重加权

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI: 10.25073/2588-1086/vnucsce.321

T. Thang, Dang Dinh Son, Le Dang Linh, Dang Xuan Vuong, Duong Quang Tien

引用次数: 3

SV - VLSP2021: The Smartcall - ITS’s Systems SV - VLSP2021: Smartcall - ITS的系统

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI: 10.25073/2588-1086/vnucsce.339

Hung Van Dinh, Tuan Van Mai, Quyen B. Dam, Bao Quoc Nguyen

引用次数: 0

VLSP 2021 - SV challenge: Vietnamese Speaker Verification in Noisy Environments VLSP 2021 - SV挑战:嘈杂环境下的越南语说话人验证

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2022-06-30 DOI: 10.25073/2588-1086/vnucsce.333

Vi Thanh Dat, Phạm Việt Thành, Nguyen Thi Thu Trang

引用次数: 4

Ultra-High-Throughput Multi-Core AES Encryption Hardware Architecture 超高吞吐量多核AES加密硬件架构

VNU Journal of Science: Computer Science and Communication Engineering Pub Date : 2021-11-11 DOI: 10.25073/2588-1086/vnucsce.290

Pham-Khoi Dong, Hung K. Nguyen, F. Hussin, Xuan-Tu Tran

{"title":"Ultra-High-Throughput Multi-Core AES Encryption Hardware Architecture","authors":"Pham-Khoi Dong, Hung K. Nguyen, F. Hussin, Xuan-Tu Tran","doi":"10.25073/2588-1086/vnucsce.290","DOIUrl":"https://doi.org/10.25073/2588-1086/vnucsce.290","url":null,"abstract":"Security issues in high-speed data transfer between devices are always a big challenge. On the other hand, new data transfer standards such as IEEE P802.3bs 2017 stipulate the maximum data rate up to 400 Gbps. So, security encryptions need high throughput to meet data transfer rates and low latency to ensure the quality of services. In this paper, we propose a multi-core AES encryption hardware architecture to achieve ultra-high-throughput encryption. To reduce area cost and power consumption, these cores share the same KeyExpansion blocks. Fully parallel, outer round pipeline technique is also applied to the proposed architecture to achieve low latency encryption. The design has been modelled at RTL (Register-Transfer-Level) in VHDL and then synthesized with a CMOS 45nm technology using Synopsys Design Compiler. With 10-cores fully parallel and outer round pipeline, the implementation results show that our architecture achieves a throughput of 1 Tbps at the maximum operating frequency of 800 MHz. These results meet the speed requirements of future communication standards. In addition, our design also achieves a high power-efficiency of 2377 Gbps/W and area-efficiency of 833 Gbps/mm2, that is 2.6x and 4.5x higher than those of the other highest throughput of single-core AES, respectively.","PeriodicalId":416488,"journal":{"name":"VNU Journal of Science: Computer Science and Communication Engineering","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114604347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1