{"title":"面向说话人识别的多模态数据库系统模型","authors":"J. Balcerek, I. Chmielewska","doi":"10.1109/SPA.2007.5903316","DOIUrl":null,"url":null,"abstract":"In the paper, the multimodal database system model designed to store the audio, video, graphical and text file records for speaker identification is proposed. The opportunity to make the records accessible on-line via Web in the client-server architecture using an open source DBMS is provided, and the access involves sharing, loading, modifying or completing the records depending on the user's authorization level. Due to the modular structure and the categorization concept, the scalability of the system along the macro axis (purpose-oriented component databases) and micro axes (number of speakers/recording conditions, language, etc.) has been obtained. A specific solution has been proposed regarding a need for dealing with great volume files.","PeriodicalId":274617,"journal":{"name":"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speaker identification - oriented multimodal database system model\",\"authors\":\"J. Balcerek, I. Chmielewska\",\"doi\":\"10.1109/SPA.2007.5903316\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the paper, the multimodal database system model designed to store the audio, video, graphical and text file records for speaker identification is proposed. The opportunity to make the records accessible on-line via Web in the client-server architecture using an open source DBMS is provided, and the access involves sharing, loading, modifying or completing the records depending on the user's authorization level. Due to the modular structure and the categorization concept, the scalability of the system along the macro axis (purpose-oriented component databases) and micro axes (number of speakers/recording conditions, language, etc.) has been obtained. A specific solution has been proposed regarding a need for dealing with great volume files.\",\"PeriodicalId\":274617,\"journal\":{\"name\":\"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007\",\"volume\":\"100 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPA.2007.5903316\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2007","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPA.2007.5903316","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speaker identification - oriented multimodal database system model
In the paper, the multimodal database system model designed to store the audio, video, graphical and text file records for speaker identification is proposed. The opportunity to make the records accessible on-line via Web in the client-server architecture using an open source DBMS is provided, and the access involves sharing, loading, modifying or completing the records depending on the user's authorization level. Due to the modular structure and the categorization concept, the scalability of the system along the macro axis (purpose-oriented component databases) and micro axes (number of speakers/recording conditions, language, etc.) has been obtained. A specific solution has been proposed regarding a need for dealing with great volume files.