Cell Block HTML: Towards Spreadsheet-based Text-Mining for the Masses

2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL) Pub Date : 2021-09-01 DOI:10.1109/JCDL52503.2021.00041

B. Wheeler, D. Bainbridge

引用次数: 0

Abstract

This article details a technical advancement in the core ability of spreadsheets to be able to natively handle forms of rich text, such as HTML. We establish the context to the work, and specify the criteria we needed to meet so that the expansion of spreadsheet computation to handle sophisticated forms of text analysis-comparable to that of numeric calculation-remained within the purview of regular users. Implementation details are provided, along with an example illustrating the application of a LDA-based text-mining technique to perform topic modeling.

查看原文本刊更多论文

单元格块HTML:面向大众的基于电子表格的文本挖掘

本文详细介绍了电子表格核心能力的技术进步，使其能够本地处理富文本的形式，如HTML。我们建立工作的上下文，并指定我们需要满足的标准，以便扩展电子表格计算来处理复杂形式的文本分析——与数字计算相当——仍然在普通用户的范围内。本文提供了实现细节，并提供了一个示例，说明了如何应用基于lda的文本挖掘技术来执行主题建模。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)

自引率

0.00%

发文量