面向说话人识别的多模态数据库系统模型

J. Balcerek, I. Chmielewska
{"title":"面向说话人识别的多模态数据库系统模型","authors":"J. Balcerek, I. Chmielewska","doi":"10.1109/SPA.2007.5903316","DOIUrl":null,"url":null,"abstract":"In the paper, the multimodal database system model designed to store the audio, video, graphical and text file records for speaker identification is proposed. The opportunity to make the records accessible on-line via Web in the client-server architecture using an open source DBMS is provided, and the access involves sharing, loading, modifying or completing the records depending on the user's authorization level. Due to the modular structure and the categorization concept, the scalability of the system along the macro axis (purpose-oriented component databases) and micro axes (number of speakers/recording conditions, language, etc.) has been obtained. A specific solution has been proposed regarding a need for dealing with great volume files.","PeriodicalId":274617,"journal":{"name":"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speaker identification - oriented multimodal database system model\",\"authors\":\"J. Balcerek, I. Chmielewska\",\"doi\":\"10.1109/SPA.2007.5903316\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the paper, the multimodal database system model designed to store the audio, video, graphical and text file records for speaker identification is proposed. The opportunity to make the records accessible on-line via Web in the client-server architecture using an open source DBMS is provided, and the access involves sharing, loading, modifying or completing the records depending on the user's authorization level. Due to the modular structure and the categorization concept, the scalability of the system along the macro axis (purpose-oriented component databases) and micro axes (number of speakers/recording conditions, language, etc.) has been obtained. A specific solution has been proposed regarding a need for dealing with great volume files.\",\"PeriodicalId\":274617,\"journal\":{\"name\":\"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007\",\"volume\":\"100 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPA.2007.5903316\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPA.2007.5903316","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种多模态数据库系统模型,用于存储说话人身份识别的音频、视频、图形和文本文件记录。提供了使用开源DBMS通过Web在客户机-服务器体系结构中在线访问记录的机会,访问包括根据用户的授权级别共享、加载、修改或完成记录。由于模块化结构和分类概念,系统沿宏观轴(面向目的的组件数据库)和微观轴(说话人数量/录音条件、语言等)具有可扩展性。针对处理大容量文件的需要,提出了一个具体的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Speaker identification - oriented multimodal database system model
In the paper, the multimodal database system model designed to store the audio, video, graphical and text file records for speaker identification is proposed. The opportunity to make the records accessible on-line via Web in the client-server architecture using an open source DBMS is provided, and the access involves sharing, loading, modifying or completing the records depending on the user's authorization level. Due to the modular structure and the categorization concept, the scalability of the system along the macro axis (purpose-oriented component databases) and micro axes (number of speakers/recording conditions, language, etc.) has been obtained. A specific solution has been proposed regarding a need for dealing with great volume files.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信