{"title":"Exploratory analysis of large web datasets","authors":"S. Castano, A. Ferrara, S. Montanelli","doi":"10.1109/RTSI.2015.7325105","DOIUrl":null,"url":null,"abstract":"In the era of big data, the capability to identify very quickly prominent summary information about a target entity of interest, like a person or an event, from large datasets is essential, and exploratory analysis techniques help in this direction. In this paper, we provide a solution based on smart entity views and on pre-defined analysis operators which exploit keywords available in the entity view together with similarity information to produce summary information about the view contents from both a thematic and analytics perspective. In particular, smart entity views can be analyzed according to the following exploratory paradigms: entity expansion, entity visualization, and entity analytics. The proposed approach is discussed by referring to a case study of twitter dataset related to the “Expo2015” event as target entity.","PeriodicalId":187166,"journal":{"name":"2015 IEEE 1st International Forum on Research and Technologies for Society and Industry Leveraging a better tomorrow (RTSI)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 1st International Forum on Research and Technologies for Society and Industry Leveraging a better tomorrow (RTSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RTSI.2015.7325105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In the era of big data, the capability to identify very quickly prominent summary information about a target entity of interest, like a person or an event, from large datasets is essential, and exploratory analysis techniques help in this direction. In this paper, we provide a solution based on smart entity views and on pre-defined analysis operators which exploit keywords available in the entity view together with similarity information to produce summary information about the view contents from both a thematic and analytics perspective. In particular, smart entity views can be analyzed according to the following exploratory paradigms: entity expansion, entity visualization, and entity analytics. The proposed approach is discussed by referring to a case study of twitter dataset related to the “Expo2015” event as target entity.