{"title":"单元格块HTML:面向大众的基于电子表格的文本挖掘","authors":"B. Wheeler, D. Bainbridge","doi":"10.1109/JCDL52503.2021.00041","DOIUrl":null,"url":null,"abstract":"This article details a technical advancement in the core ability of spreadsheets to be able to natively handle forms of rich text, such as HTML. We establish the context to the work, and specify the criteria we needed to meet so that the expansion of spreadsheet computation to handle sophisticated forms of text analysis-comparable to that of numeric calculation-remained within the purview of regular users. Implementation details are provided, along with an example illustrating the application of a LDA-based text-mining technique to perform topic modeling.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cell Block HTML: Towards Spreadsheet-based Text-Mining for the Masses\",\"authors\":\"B. Wheeler, D. Bainbridge\",\"doi\":\"10.1109/JCDL52503.2021.00041\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article details a technical advancement in the core ability of spreadsheets to be able to natively handle forms of rich text, such as HTML. We establish the context to the work, and specify the criteria we needed to meet so that the expansion of spreadsheet computation to handle sophisticated forms of text analysis-comparable to that of numeric calculation-remained within the purview of regular users. Implementation details are provided, along with an example illustrating the application of a LDA-based text-mining technique to perform topic modeling.\",\"PeriodicalId\":112400,\"journal\":{\"name\":\"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)\",\"volume\":\"79 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/JCDL52503.2021.00041\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCDL52503.2021.00041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Cell Block HTML: Towards Spreadsheet-based Text-Mining for the Masses
This article details a technical advancement in the core ability of spreadsheets to be able to natively handle forms of rich text, such as HTML. We establish the context to the work, and specify the criteria we needed to meet so that the expansion of spreadsheet computation to handle sophisticated forms of text analysis-comparable to that of numeric calculation-remained within the purview of regular users. Implementation details are provided, along with an example illustrating the application of a LDA-based text-mining technique to perform topic modeling.