{"title":"Content-based RSS and broadcast news streams aggregation and retrieval","authors":"A. Messina, M. Montagnuolo","doi":"10.1109/ICDIM.2008.4746702","DOIUrl":null,"url":null,"abstract":"Nowadays, the global diffusion of the Internet is enabling the distribution of informative content through dynamic media such as RSS feeds and blogs. At the same time, the decreasing cost of electronic devices is increasing the pervasive availability of the same informative content in the form of digital audiovisual items. Hence, efficient and low-cost solutions for semantic aggregation of heterogeneous multimedia content are needed, in order to enable users to retrieve the desired information from this variety of channels. In this paper, we present an unsupervised framework for content-based Web newspaper articles and broadcast news stories aggregation and retrieval. The core of our system is based on a hybrid clustering algorithm that uses RSS feeds and television newscasts programmes as information sources and builds up a multimodal service made up of items including both contributions. Extensive experiments prove the effectiveness of our method.","PeriodicalId":415013,"journal":{"name":"2008 Third International Conference on Digital Information Management","volume":"293 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Third International Conference on Digital Information Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2008.4746702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Nowadays, the global diffusion of the Internet is enabling the distribution of informative content through dynamic media such as RSS feeds and blogs. At the same time, the decreasing cost of electronic devices is increasing the pervasive availability of the same informative content in the form of digital audiovisual items. Hence, efficient and low-cost solutions for semantic aggregation of heterogeneous multimedia content are needed, in order to enable users to retrieve the desired information from this variety of channels. In this paper, we present an unsupervised framework for content-based Web newspaper articles and broadcast news stories aggregation and retrieval. The core of our system is based on a hybrid clustering algorithm that uses RSS feeds and television newscasts programmes as information sources and builds up a multimodal service made up of items including both contributions. Extensive experiments prove the effectiveness of our method.