Quantification of time in Digital Libraries: Temporal Zipf's law

S. Rizzo, D. Montesi
{"title":"Quantification of time in Digital Libraries: Temporal Zipf's law","authors":"S. Rizzo, D. Montesi","doi":"10.1145/3105831.3105866","DOIUrl":null,"url":null,"abstract":"The temporal dimension of a text document defines the temporal scope of its narrated event. This temporal dimension acquires more importance in corpora created along several years of production, such as digital libraries. Temporal aspects of text have been the subject of many researches with specific tasks, notably information retrieval and event detection, while no studies have been conducted to quantify and analyze the richness of the temporal dimension of different text collections. Analysing thirteen text collections we show how the extent and characteristics of the time presence in text varies among collections that have different scopes, although time intervals are mentioned in almost all the text units analyzed. We found that unique intervals follow the same distribution, given by the Zipf's law, that holds for single words.","PeriodicalId":319729,"journal":{"name":"Proceedings of the 21st International Database Engineering & Applications Symposium","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st International Database Engineering & Applications Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3105831.3105866","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The temporal dimension of a text document defines the temporal scope of its narrated event. This temporal dimension acquires more importance in corpora created along several years of production, such as digital libraries. Temporal aspects of text have been the subject of many researches with specific tasks, notably information retrieval and event detection, while no studies have been conducted to quantify and analyze the richness of the temporal dimension of different text collections. Analysing thirteen text collections we show how the extent and characteristics of the time presence in text varies among collections that have different scopes, although time intervals are mentioned in almost all the text units analyzed. We found that unique intervals follow the same distribution, given by the Zipf's law, that holds for single words.
数字图书馆中时间的量化:时间齐夫定律
文本文档的时间维度定义了其叙述事件的时间范围。这种时间维度在经过数年生产的语料库中变得更加重要,比如数字图书馆。文本的时间维度一直是许多具有特定任务的研究的主题,特别是信息检索和事件检测,而没有研究对不同文本集合的时间维度的丰富度进行量化和分析。通过对13个文本集的分析,我们展示了文本中时间存在的程度和特征在具有不同范围的集合中是如何变化的,尽管时间间隔在几乎所有被分析的文本单元中都被提到。我们发现唯一间隔遵循相同的分布,由齐夫定律给出,适用于单个单词。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信