{"title":"Text Extraction and Clustering for Multimedia: A review on Techniques and Challenges","authors":"Zaheeruddin Ahmed, Harvir Singh","doi":"10.1109/ICD47981.2019.9105905","DOIUrl":null,"url":null,"abstract":"The internet technologies have developed rapidly over recent times producing massive sets of multimedia data where text, images, audios, and videos delivering a huge set of content. The text ingrained in this multimedia is generated from the web and social media that carry complex and meaningful data. There is an increasing need to recognize and extract text from multimedia data which is in unstructured form. Many new techniques have been applied to address the need for text extraction for multimedia but not all have been efficient. One important application of text analysis is to extract text information and then recognize meaningful data visualization for better decisions. This paper focuses on addressing significant text extraction and clustering structures, techniques and challenges from multimedia data set. We will highlight different approaches to text extraction and clustering from multimedia content.","PeriodicalId":277894,"journal":{"name":"2019 International Conference on Digitization (ICD)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Digitization (ICD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICD47981.2019.9105905","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The internet technologies have developed rapidly over recent times producing massive sets of multimedia data where text, images, audios, and videos delivering a huge set of content. The text ingrained in this multimedia is generated from the web and social media that carry complex and meaningful data. There is an increasing need to recognize and extract text from multimedia data which is in unstructured form. Many new techniques have been applied to address the need for text extraction for multimedia but not all have been efficient. One important application of text analysis is to extract text information and then recognize meaningful data visualization for better decisions. This paper focuses on addressing significant text extraction and clustering structures, techniques and challenges from multimedia data set. We will highlight different approaches to text extraction and clustering from multimedia content.