{"title":"Cell Block HTML: Towards Spreadsheet-based Text-Mining for the Masses","authors":"B. Wheeler, D. Bainbridge","doi":"10.1109/JCDL52503.2021.00041","DOIUrl":null,"url":null,"abstract":"This article details a technical advancement in the core ability of spreadsheets to be able to natively handle forms of rich text, such as HTML. We establish the context to the work, and specify the criteria we needed to meet so that the expansion of spreadsheet computation to handle sophisticated forms of text analysis-comparable to that of numeric calculation-remained within the purview of regular users. Implementation details are provided, along with an example illustrating the application of a LDA-based text-mining technique to perform topic modeling.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCDL52503.2021.00041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This article details a technical advancement in the core ability of spreadsheets to be able to natively handle forms of rich text, such as HTML. We establish the context to the work, and specify the criteria we needed to meet so that the expansion of spreadsheet computation to handle sophisticated forms of text analysis-comparable to that of numeric calculation-remained within the purview of regular users. Implementation details are provided, along with an example illustrating the application of a LDA-based text-mining technique to perform topic modeling.