面向说话人识别的多模态数据库系统模型

Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007 Pub Date : 2007-09-01 DOI:10.1109/SPA.2007.5903316

J. Balcerek, I. Chmielewska

{"title":"面向说话人识别的多模态数据库系统模型","authors":"J. Balcerek, I. Chmielewska","doi":"10.1109/SPA.2007.5903316","DOIUrl":null,"url":null,"abstract":"In the paper, the multimodal database system model designed to store the audio, video, graphical and text file records for speaker identification is proposed. The opportunity to make the records accessible on-line via Web in the client-server architecture using an open source DBMS is provided, and the access involves sharing, loading, modifying or completing the records depending on the user's authorization level. Due to the modular structure and the categorization concept, the scalability of the system along the macro axis (purpose-oriented component databases) and micro axes (number of speakers/recording conditions, language, etc.) has been obtained. A specific solution has been proposed regarding a need for dealing with great volume files.","PeriodicalId":274617,"journal":{"name":"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speaker identification - oriented multimodal database system model\",\"authors\":\"J. Balcerek, I. Chmielewska\",\"doi\":\"10.1109/SPA.2007.5903316\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the paper, the multimodal database system model designed to store the audio, video, graphical and text file records for speaker identification is proposed. The opportunity to make the records accessible on-line via Web in the client-server architecture using an open source DBMS is provided, and the access involves sharing, loading, modifying or completing the records depending on the user's authorization level. Due to the modular structure and the categorization concept, the scalability of the system along the macro axis (purpose-oriented component databases) and micro axes (number of speakers/recording conditions, language, etc.) has been obtained. A specific solution has been proposed regarding a need for dealing with great volume files.\",\"PeriodicalId\":274617,\"journal\":{\"name\":\"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007\",\"volume\":\"100 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPA.2007.5903316\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPA.2007.5903316","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种多模态数据库系统模型，用于存储说话人身份识别的音频、视频、图形和文本文件记录。提供了使用开源DBMS通过Web在客户机-服务器体系结构中在线访问记录的机会，访问包括根据用户的授权级别共享、加载、修改或完成记录。由于模块化结构和分类概念，系统沿宏观轴(面向目的的组件数据库)和微观轴(说话人数量/录音条件、语言等)具有可扩展性。针对处理大容量文件的需要，提出了一个具体的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Speaker identification - oriented multimodal database system model

In the paper, the multimodal database system model designed to store the audio, video, graphical and text file records for speaker identification is proposed. The opportunity to make the records accessible on-line via Web in the client-server architecture using an open source DBMS is provided, and the access involves sharing, loading, modifying or completing the records depending on the user's authorization level. Due to the modular structure and the categorization concept, the scalability of the system along the macro axis (purpose-oriented component databases) and micro axes (number of speakers/recording conditions, language, etc.) has been obtained. A specific solution has been proposed regarding a need for dealing with great volume files.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007

自引率

0.00%

发文量