{"title":"Quantification of time in Digital Libraries: Temporal Zipf's law","authors":"S. Rizzo, D. Montesi","doi":"10.1145/3105831.3105866","DOIUrl":null,"url":null,"abstract":"The temporal dimension of a text document defines the temporal scope of its narrated event. This temporal dimension acquires more importance in corpora created along several years of production, such as digital libraries. Temporal aspects of text have been the subject of many researches with specific tasks, notably information retrieval and event detection, while no studies have been conducted to quantify and analyze the richness of the temporal dimension of different text collections. Analysing thirteen text collections we show how the extent and characteristics of the time presence in text varies among collections that have different scopes, although time intervals are mentioned in almost all the text units analyzed. We found that unique intervals follow the same distribution, given by the Zipf's law, that holds for single words.","PeriodicalId":319729,"journal":{"name":"Proceedings of the 21st International Database Engineering & Applications Symposium","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st International Database Engineering & Applications Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3105831.3105866","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The temporal dimension of a text document defines the temporal scope of its narrated event. This temporal dimension acquires more importance in corpora created along several years of production, such as digital libraries. Temporal aspects of text have been the subject of many researches with specific tasks, notably information retrieval and event detection, while no studies have been conducted to quantify and analyze the richness of the temporal dimension of different text collections. Analysing thirteen text collections we show how the extent and characteristics of the time presence in text varies among collections that have different scopes, although time intervals are mentioned in almost all the text units analyzed. We found that unique intervals follow the same distribution, given by the Zipf's law, that holds for single words.