EmoBone:多国情感骨传导语音音频数据集

IF 1 4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC
Md. Sarwar Hosain, Yosuke Sugiura, M. Shahidur Rahman, Tetsuya Shimamura
{"title":"EmoBone:多国情感骨传导语音音频数据集","authors":"Md. Sarwar Hosain,&nbsp;Yosuke Sugiura,&nbsp;M. Shahidur Rahman,&nbsp;Tetsuya Shimamura","doi":"10.1002/tee.24110","DOIUrl":null,"url":null,"abstract":"<p>This paper introduces EmoBone, a comprehensive audio-only emotional bone-conducted speech dataset featuring speakers from various countries. The dataset comprises speeches from 28 individuals representing 10 different nations, with each participant delivering 10 sentences designed to evoke distinct emotions. In addition to an air-conducted microphone, the recordings utilized bone conduction technology, transmitting sound directly to the speakers' inner ears, ensuring high-quality emotional speech recordings. To assess the validity of the dataset, 80 university students from Bangladesh listened to the recordings and successfully identified the expressed emotions with an accuracy exceeding 76%. Statistical methods were also employed to evaluate the reliability of the dataset, revealing a high level of agreement among raters. EmoBone, with a cumulative duration surpassing 19 h and 15 680 unique utterances, stands as the most extensive emotional speech dataset available. This makes it a valuable tool for studying how emotional speech varies across cultures. Furthermore, due to its utilization of bone conduction technology, EmoBone facilitates the study of acoustic features in emotional speech from diverse dimensions. The data that supports the findings of this study is available upon reasonable request. © 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.</p>","PeriodicalId":13435,"journal":{"name":"IEEJ Transactions on Electrical and Electronic Engineering","volume":"19 9","pages":"1492-1506"},"PeriodicalIF":1.0000,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"EmoBone: A Multinational Audio Dataset of Emotional Bone Conducted Speech\",\"authors\":\"Md. Sarwar Hosain,&nbsp;Yosuke Sugiura,&nbsp;M. Shahidur Rahman,&nbsp;Tetsuya Shimamura\",\"doi\":\"10.1002/tee.24110\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper introduces EmoBone, a comprehensive audio-only emotional bone-conducted speech dataset featuring speakers from various countries. The dataset comprises speeches from 28 individuals representing 10 different nations, with each participant delivering 10 sentences designed to evoke distinct emotions. In addition to an air-conducted microphone, the recordings utilized bone conduction technology, transmitting sound directly to the speakers' inner ears, ensuring high-quality emotional speech recordings. To assess the validity of the dataset, 80 university students from Bangladesh listened to the recordings and successfully identified the expressed emotions with an accuracy exceeding 76%. Statistical methods were also employed to evaluate the reliability of the dataset, revealing a high level of agreement among raters. EmoBone, with a cumulative duration surpassing 19 h and 15 680 unique utterances, stands as the most extensive emotional speech dataset available. This makes it a valuable tool for studying how emotional speech varies across cultures. Furthermore, due to its utilization of bone conduction technology, EmoBone facilitates the study of acoustic features in emotional speech from diverse dimensions. The data that supports the findings of this study is available upon reasonable request. © 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.</p>\",\"PeriodicalId\":13435,\"journal\":{\"name\":\"IEEJ Transactions on Electrical and Electronic Engineering\",\"volume\":\"19 9\",\"pages\":\"1492-1506\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEJ Transactions on Electrical and Electronic Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/tee.24110\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEJ Transactions on Electrical and Electronic Engineering","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/tee.24110","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

本文介绍了 EmoBone,这是一个由来自不同国家的演讲者组成的纯音频情感骨骼演讲数据集。该数据集包括来自 10 个不同国家的 28 位演讲者的演讲,每位演讲者都发表了 10 个旨在唤起不同情绪的句子。除了气导麦克风外,录音还采用了骨传导技术,将声音直接传到演讲者的内耳,确保了高质量的情感语音录音。为了评估数据集的有效性,来自孟加拉国的 80 名大学生聆听了录音,并成功识别了所表达的情绪,准确率超过 76%。此外,还采用了统计方法来评估数据集的可靠性,结果显示评分者之间的一致性很高。EmoBone 的累计持续时间超过 19 小时,包含 15 680 个独特的语句,是目前最广泛的情绪语音数据集。这使其成为研究不同文化间情绪语音差异的重要工具。此外,由于使用了骨传导技术,EmoBone 还有助于从不同维度研究情绪语音的声学特征。如有合理要求,可提供支持本研究结果的数据。© 2024 日本电气工程师学会和 Wiley Periodicals LLC。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
EmoBone: A Multinational Audio Dataset of Emotional Bone Conducted Speech

This paper introduces EmoBone, a comprehensive audio-only emotional bone-conducted speech dataset featuring speakers from various countries. The dataset comprises speeches from 28 individuals representing 10 different nations, with each participant delivering 10 sentences designed to evoke distinct emotions. In addition to an air-conducted microphone, the recordings utilized bone conduction technology, transmitting sound directly to the speakers' inner ears, ensuring high-quality emotional speech recordings. To assess the validity of the dataset, 80 university students from Bangladesh listened to the recordings and successfully identified the expressed emotions with an accuracy exceeding 76%. Statistical methods were also employed to evaluate the reliability of the dataset, revealing a high level of agreement among raters. EmoBone, with a cumulative duration surpassing 19 h and 15 680 unique utterances, stands as the most extensive emotional speech dataset available. This makes it a valuable tool for studying how emotional speech varies across cultures. Furthermore, due to its utilization of bone conduction technology, EmoBone facilitates the study of acoustic features in emotional speech from diverse dimensions. The data that supports the findings of this study is available upon reasonable request. © 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEJ Transactions on Electrical and Electronic Engineering
IEEJ Transactions on Electrical and Electronic Engineering 工程技术-工程:电子与电气
CiteScore
2.70
自引率
10.00%
发文量
199
审稿时长
4.3 months
期刊介绍: IEEJ Transactions on Electrical and Electronic Engineering (hereinafter called TEEE ) publishes 6 times per year as an official journal of the Institute of Electrical Engineers of Japan (hereinafter "IEEJ"). This peer-reviewed journal contains original research papers and review articles on the most important and latest technological advances in core areas of Electrical and Electronic Engineering and in related disciplines. The journal also publishes short communications reporting on the results of the latest research activities TEEE ) aims to provide a new forum for IEEJ members in Japan as well as fellow researchers in Electrical and Electronic Engineering from around the world to exchange ideas and research findings.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信